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Chapter 1 

Preliminaries on Modern Economics 
and Mathematics 

In this chapter, we first set out some basic terminologies and key assumptions imposed 
in modern economics in general and in the lecture notes in particular. We will dis- 
cuss the standard analytical framework usually used in modern economics. We will also 
discuss methodologies for studying modern economics as well as some key points one 
should pay attention. The methodologies for studying modern economics and key points 
include: providing studying platforms, establishing reference/benchmark systems, devel- 
oping analytical tools, noting generality and limitation of an economic theory, the role of 
mathematics, distinguishing necessary and sufficient conditions for a statement, and con- 
version between economic and mathematical language. We will then discuss some basic 
mathematics results that we will use in the lecture notes. 

1.1 Nature of Modern Economics 

1.1.1 Modern Economics and Economic Theory 

• What is economics about? 

Economics is a social science that studies individuals' economic behavior, economic 
phenomena, as well as how individual agents, such as consumers, firms, and government 
agencies, make trade-off choices that allocate limited resources among competing uses. 
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People's desires are unlimited, but resources are limited, therefore individuals must 
make trade-offs. We need economics to study this fundamental conflict and how these 
trade-offs are best made. 

• Four basic questions must be answered by any economic insti- 
tution: 

(f) What goods and services should be produced and in what quantity? 

(2) How should the product be produced? 

(3) For whom should it be produced and how should it be distributed? 

(4) Who makes the decision? 

The answers depend on the use of economic institutions. There are two basic economic 
institutions that have been so far used in the real world: 

(f) Market economic institution (the price mechanism): Most decisions on 
economic activities are made by individuals. This primarily decentralized 
decision system is the most important economic institution discovered for 
reaching cooperation amongst individuals and solving the conflicts that 
occur between them. The market economy has been proven to be only 
economic institution, so far, that can keep sustainable development and 
growth within an economy. 

(2) Planed economic institution: Most decisions on economic activities are 
made by governments, which are mainly centralized decision systems. 

• What is Modern Economics? 

Modern economics, mainly developed in last sixty years, systematically studies individu- 
als' economic behavior and economic phenomena by a scientific studying method - obser- 
vation — > theory — > observation - and through the use of various analytical approaches. 

• What is Economic Theory? 
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An economic theory, which can be considered an axiomatic approach, consists of a set 
of assumptions and conditions, an analytical framework, and conclusions (explanations 
and/or predications) that are derived from the assumptions and the analytical framework. 

Like any science, economics is concerned with the explanation of observed phenom- 
ena and also makes economic predictions and assessments based on economic theories. 
Economic theories are developed to explain the observed phenomena in terms of a set of 
basic assumptions and rules. 

• Microeconomic theory 

Microeconomic theory aims to model economic activities as the interaction of individual 
economic agents pursuing their private interests. 

1.1.2 Key Assumptions and Desired Properties Commonly Used 
Economics 

Economists usually make all or some of the following key assumptions and conditions 
when they study economic problems: 

(1) Individuals are (bounded) rational: self-interested behavior assumption; 

(2) Scarcity of resources: individuals confront scarce resources; 

(3) Information about individual's economic characteristics and actions is 
incomplete or asymmetric to a decision marker. As such decentralized 
decision makings is desired. 

(4) Economic freedom: voluntary cooperation and voluntary exchange; 

(5) Incentive compatibility of parties: the system or economic mechanism 
should solve the problem of interest conflicts among individuals or eco- 
nomic units; 

(6) Well-defined property rights; 

(7) Equity in opportunity; 

(8) Allocative efficiency of resources; 

Relaxing any of these assumptions may result in different conclusions. 
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1.1.3 The Basic Analytical Framework of Modern Economics 

The basic analytical framework for an economic theory consists of five aspects or steps: 
(1) specification of economic environments, (2) imposition of behavioral assumptions, 
(3) adoption of economic institutional arrangements, (4) determination of equilibria, and 
(5) evaluation of outcomes resulting from a undertaken institution. The framework is a 
framework that uses to deal with daily activities and is used to study particular economic 
issues and questions that economists are interested in. Almost of all theoretical economics 
papers adopt this framework. As such, to have a solid training in economics and doing 
research, it is very important to master this basic analytical framework, specifically, these 
five steps. It can be helpful at least in the following three aspects: (1) Help to understand 
economic theories and their arguments relatively easily. (2) Help to find research topics. 
(3) How to write standard scientific economics papers. 

Understanding this basic analytical framework can help people classify possible mis- 
understandings about modern economics, and can also help them use the basic economic 
principles or develop new economic theories to solve economic problems in various eco- 
nomic environments, with different human behavior and institutional arrangements. 

1. Specification of Economic Environments 

The first step for studying an economic issue is to specify the economic environment. 
The specification on economic environment can be divided into two levels: 1) description 
of the economic environment, and 2) characterization of the economic environment. To 
perform these well, the description is a job of science, and the characterization is a job 
of art. The more clear and accurate the description of the economic environment is, the 
higher the possibility is of the correctness of the theoretical conclusions. The more refined 
the characterization of the economic environment is, the simpler and easier the arguments 
and conclusions will obtain. 

Modern economics provides various perspectives or angles to look at real world eco- 
nomic issues. An economic phenomenon or issue may be very complicated and be affected 
by many factors. The approach of characterizing the economic environment can grasp the 
most essential factors of the issue and take our attention to the most key and core charac- 
teristics of an issue so that we can avoid unimportant details. An economic environment 
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usually consists of (1) a number of individuals, (2) the individuals' characteristics, such 
as preferences, technologies, endowments, etc. (3) informational structures, and (4) insti- 
tutional economic environments that include fundamental rules for establishing the basis 
for production, exchange, and distribution. 

2. Imposition of Behavior Assumptions 

The second step for studying an economic issue is to make assumptions on individuals' 
behavior. Making appropriate assumptions is of fundamental importance for obtaining 
a valuable economic theory or assessment. A key assumption modern economics makes 
about an individual's behavior is that an individual is self-interested. This is a main dif- 
ference between individuals and other subjects. The self-interested behavior assumption 
is not only reasonable and realistic, but also have a minimum risk. Even this assumption 
is not suitable to an economic environment, it does not cause a big trouble to the econ- 
omy even if it is applied to the economy. A rule of a game designed for self-interested 
individuals is likely also suitable for altruists, but the reverse is likely not true. 

3. Adoption of Economic Institutional Arrangement 

The third step for studying an economic issue is to adopt the economic institutional 
arrangements, which are also called economic mechanisms, which can be regarded as 
the rules of the game. Depending on the problem under consideration, an economic 
institutional arrangement could be exogenously given or endogenously determined. For 
instance, when studying individuals' decisions in the theories of consumers and producers, 
one implicitly assumes that the undertaken mechanism is a competitive market mechanism 
takes it as given. However, when considering the choice of economic institutions and 
arguing the optimality of the market mechanism, the market institution is endogenously 
determined. The alternative mechanisms that are designed to solve the problem of market 
failure are also endogenously determined. Economic arrangements should be designed 
differently for different economic environments and behavior assumptions. 
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4. Determination of Equilibria 

The fourth step for studying an economic issue is to make trade-off choices and determine 
the "best" one. Once given an economic environment, institutional arrangement, and 
other constraints, such as technical, resource, and budget constraints, individuals will 
react, based on their incentives and own behavior, and choose an outcome from among 
the available or feasible outcomes. Such a state is called equilibrium and the outcome an 
equilibrium outcome. This is the most general definition an economic "equilibrium". 

5. Evaluations 

The fifth step in studying an economic issue is to evaluate outcomes resulting from the 
undertaken institutional arrangement and to make value judgments of the chosen equilib- 
rium outcome and economic mechanism based on certain criterion. The most important 
criterion adopted in modern economics is the notion of efficiency or the "first best". If 
an outcome is not efficient, there is room for improvement. The other criterions include 
equity, fairness, incentive-compatibility, informational efficiency, and operation costs for 
running an economic mechanism. 

In summary, in studying an economic issue, one should start by specifying economic 
environments and then study how individuals interact under the self-interested motion 
of the individuals within an exogenously given or endogenously determined mechanis- 
m. Economists usually use "equilibrium," "efficiency", "information", and "incentive- 
compatibility" as focal points, and investigate the effects of various economic mechanisms 
on the behavior of agents and economic units, show how individuals reach equilibria, and 
evaluate the status at equilibrium. Analyzing an economic problem using such a basic an- 
alytical framework has not only consistence in methodology, but also in getting surprising 
(but logically consistent) conclusions. 

1.1.4 Methodologies for Studying Modern Economics 

As discussed above, any economic theory usually consists of five aspects. Discussions on 
these five steps will naturally amplify into how to combine these five aspects organically. 
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To do so, economists usually integrate various studying methods into their analysis. Two 
methods used in modern economics are providing various levels and aspects studying 
platforms and establishing reference/benchmark systems. 

Studying Platform 

A studying platform in modern economics consists of some basic economic theories or 
principles. It provides a basis for extending the existing theories and analyzing more deep 
economic issues. Examples of studying platforms are: 

(1) Consumer and producer theories provide a bedrock platform for studying 
individuals' independent decision choices. 

(2) The general equilibrium theory is based on the theories of consumers and 
producers and is a higher level platform. It provides a basis for studying 
interactions of individuals within a market institution and how the market 
equilibrium is reached in each market. 

(3) The mechanism design theory provides an even higher level of studying 
platform and can be used to study or design an economic institution. It 
can be used to compare various economic institutions or mechanisms, as 
well as to identify which one may be an "optima" . 

Reference Systems/Benchmark 

Modern economics provides various reference/benchmark systems for comparison and to 
see how far a real world is from an ideal status. A reference system is a standard economic 
model/theory that results in desired or ideal results, such as efficiency/the "first best". 
The importance of a reference system does not rely on whether or not it describes the real 
world correctly or precisely, but instead gives a criterion for understanding the real world. 
It is a mirror that lets us see the distance between various theoretical models/realistic 
economic mechanisms and the one given by the reference system. For instance, the general 
equilibrium theory we will study in the notes is such a reference system. With this 
reference system, we can study and compare equilibrium outcomes under various market 
structures with the ideal case of the perfectly competitive mechanism. The first-best 
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results in a complete information economic environment in information economics. Other 
examples include the Coase Theorem in property rights theory and economic law, and 
the Modigliani-Miller Theorem in corporate finance theory. 

Although those economic theories or economic models as reference systems may impose 
some unrealistic assumptions, they are still very useful, and can be used to make further 
analysis. They establish criterions to evaluate various theoretical models or economic 
mechanisms used in the real world. A reference system is not required, in most cases 
it is actually not needed, to predicate the real world well, but it is used to provide a 
benchmark to see how far a reality is from the ideal status given by a reference system. 
The value of a reference system is not that it can directly explain the world, but that 
it provides a benchmark for developing new theories to explain the world. In fact, the 
establishment of a reference system is very important for any scientific subject, including 
modern economics. Anyone can talk about an economic issue but the main difference is 
that a person with systematic training in modern economics has a few reference systems 
in her mind while a person without training in modern economics does not so he cannot 
grasp essential parts of the issue and cannot provide deep analysis and insights. 

Analytical Tools 

Modern economics also provides various powerful analytical tools that are usually given 
by geometrical or mathematical models. Advantages of such tools can help us to analyze 
complicated economic behavior and phenomena through a simple diagram or mathemat- 
ical structure in a model. Examples include (1) the demand-supply curve model, (2) 
Samuelson's overlapping generation model, (3) the principal-agent model, and (4) the 
game theoretical model. 

1.1.5 Roles, Generality, and Limitation of Economic Theory 
Roles of Economic Theory 

An economic theory has three possible roles: (1) It can be used to explain economic 
behavior and economic phenomena in the real world. (2) It can make scientific predictions 
or deductions about possible outcomes and consequences of adopted economic mechanisms 
when economic environments and individuals' behavior are appropriately described. (3) 
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It can be used to refute faulty goals or projects before they are actually undertaken. If a 
conclusion is not possible in theory, then it is not possible in a real world setting, as long 
as the assumptions were approximated realistically. 

Generality of Economic Theory 

An economic theory is based on assumptions imposed on economic environments, individ- 
uals' behavior, and economic institutions. The more general these assumptions are, the 
more powerful, useful, or meaningful the theory that comes from them is. The general 
equilibrium theory is considered such a theory. 

Limitation of Economic Theory 

When examining the generality of an economic theory, one should realize any theory or 
assumption has a boundary, limitation, and applicable range of economic theory. Thus, 
two common misunderstandings in economic theory should be avoided. One misunder- 
standing is to over-evaluate the role of an economic theory. Every theory is based on 
some imposed assumptions. Therefore, it is important to keep in mind that every theory 
is not universal, cannot explain everything, but has its limitation and boundary of suit- 
ability. When applying a theory to make an economic conclusion and discuss an economic 
problem, it is important to notice the boundary, limitation, and applicable range of the 
theory. It cannot be applied arbitrarily, or a wrong conclusion will be the result. 

The other misunderstanding is to under-evaluate the role of an economic theory. Some 
people consider an economic theory useless because they think assumptions imposed in 
the theory are unrealistic. In fact, no theory, whether in economics, physics, or any 
other science, is perfectly correct. The validity of a theory depends on whether or not it 
succeeds in explaining and predicting the set of phenomena that it is intended to explain 
and predict. Theories, therefore, are continually tested against observations. As a result 
of this testing, they are often modified, refined, and even discarded. 

The process of testing and refining theories is central to the development of modern 
economics as a science. One example is the assumption of perfect competition. In reality, 
no competition is perfect. Real world markets seldom achieve this ideal status. The 
question is then not whether any particular market is perfectly competitive, almost no 
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one is. The appropriate question is to what degree models of perfect competition can 
generate insights about real-world markets. We think this assumption is approximately 
correct in certain situations. Just like frictionless models in physics, such as in free falling 
body movement (no air resistance), ideal gas (molecules do not collide), and ideal fluids, 
frictionless models of perfect competition generate useful insights in the economic world. 

It is often heard that someone is claiming they have toppled an existing theory or 
conclusion, or that it has been overthrown, when some condition or assumption behind 
it is criticized. This is usually needless claim, because any formal rigorous theory can 
be criticized at anytime because no assumption can coincide fully with reality or cover 
everything. So, as long as there are no logic errors or inconsistency in a theory, we cannot 
say that the theory is wrong. We can only criticize it for being too limited or unrealistic. 
What economists should do is to weaken or relax the assumptions, and obtain new theories 
based on old theories. We cannot say though that the new theory topples the old one, 
but instead that the new theory extends the old theory to cover more general situations 
and different economic environments. 

1.1.6 Roles of Mathematics in Modern Economics 

Mathematics has become an important tool in modern economics. Almost every field 
in modern economics uses mathematics and statistics. The mathematical approach to 
economic analysis is used when economists make use of mathematical symbols in the 
statement of a problem and also draw upon known mathematical theorems to aid in 
reasoning. It is not difficult to understand why the mathematical approach has become 
a dominant approach since finding the boundary of a theory, developing an analytical 
framework of a theory, establishing reference systems, and providing analytical tools all 
need mathematics. If we apply a theoretical result to real world without knowing the 
boundary of a theory, we may get a very bad consequence and hurt an economy seriously. 

Some of the advantages of using mathematics are that (l)the "language" used and the 
descriptions of assumptions are clearer, more accurate, and more precise, (2) the logical 
process of analysis is more rigorous and clearly sets the boundaries and limitations of a 
statement, (3) it can give a new result that may not be easily obtained through observation 
alone, and (4) it can reduce unnecessary debates and improve or extend existing results. 
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It should be remarked that, although mathematics is of critical importance in modern 
economics, economics is not mathematics. Economics uses mathematics as a tool in order 
to model and analyze various economic problems. Statistics and econometrics are used to 
test or measure the accuracy of our predication, and identify causalities among economic 
variables. 

1.1.7 Conversion between Economic and Mathematical Languages 

The result of economics research is an economic conclusion. A valuable economics paper 
usually consists of three parts: (1) It raises important economic questions and answering 
these questions become the objectives of a paper. (2) It establishes the economic models 
and draws and proves the conclusions obtained from the model. (3) It uses non-technical 
language to explain the results and, if relevant, provides policy suggestions. 

Thus, the production of an economic conclusion usually goes in three stages: Stage 1: 
(non-mathematical language stage) Produce preliminary outputs, propose economic ideas, 
intuitions, and conjectures. (2) Stage 2: (mathematical language stage) Produce inter- 
mediate outputs, give a formal and rigorous result through mathematical modeling that 
specifies the boundary of a theory. Stage 3: (non-technical language stage) Produce final 
outputs, conclusions, insights, and statements that can be understood by non-specialists. 

1.1.8 Distinguish between Necessary and Sufficient Conditions 
as well Normative and Positive Statements 

When discussing an economic issue, it is very important to distinguish between: (1) two 
types of conditions: necessary and sufficient conditions for a statement to be true, and (2) 
two types of statements: positive analysis and normative analysis. It is easy to confuse 
the distinction between necessary conditions and sufficient conditions, a problem that 
results often in incorrect conclusions. 

For instance, it is often heard that the market institution should not be used based 
on the fact that some countries are market economies but remain poor. The reason this 
logic results in a wrong conclusion is that they did not realize the adoption of a market 
mechanism is just a necessary condition for a country to be rich, but is not a sufficient 
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condition. Becoming a rich country also depends on other factors such as political system, 
social infrastructures, and culture. Additionally, no example of a country can be found 
so far that it is rich in the long run, that is not a market economy. 

A positive statement state facts while normative statement give opinions or value 
judgments. Distinguishing these two statements can void many unnecessary debates. 

1.2 Language and Methods of Mathematics 

This section reviews some basic mathematics results such as: continuity and concavity of 
functions, Separating Hyperplane Theorem, optimization, correspondences (point to set 
mappings), fixed point theorems, KKM lemma, maximum theorem, etc, which will be 
used to prove some results in the lecture notes. For good references about the materials 
discussed in this section, see appendixes in Hildenbrand and Kirman (1988), Mas-Colell 
(1995), and Varian (1992). 

1.2.1 Functions 

Let X and Y be two subsets of Euclidian spaces. In this text, vector inequalities, ^, >, 
and >, are defined as follows: Let a, b G IR n . Then a ^ b means a s ^ b s for all s — 1, . . . , n\ 
a > b means a ^ b but a ^ b; a > b means a s > b s for all s — 1, . . . , n. 

Definition 1.2.1 A function / : X — > R is said to be continuous if at point x G X, 

lim f(x) = f(x ), 

or equivalently, for any e > 0, there is a 5 > such that for any x G X satisfying 
\x — xq\ < 5, we have 

|/(s)-/(so)|<e 

A function f : X — > M. is said to be continuous on X if f is continuous at every point 
x G X . 

The idea of continuity is pretty straightforward: There is no disconnected point if we 
draw a function as a curve. A function is continuous if "small" changes in x produces 
"small" changes in f(x). 
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The so-called upper semi-continuity and lower semi-continuity continuities are weaker 
than continuity. Even weak conditions on continuity are transfer continuity which char- 
acterize many optimization problems and can be found in Tian (1992, 1993, 1994) and 
Tian and Zhou (1995), and Zhou and Tian (1992). 

Definition 1.2.2 A function / : X — > R is said to be upper semi- continuous if at point 
x G X, we have 

limsup/(x) ^ f(x ), 

or equivalently, for any e > 0, there is a 5 > such that for any x G X satisfying 
\x — xq\ < 5, we have 

f(x) < f(x ) + e. 

Although all the three definitions on the upper semi-continuity at x are equivalent, the 
second one is easier to be versified. 

A function / : X — > R is said to be upper semi- continuous on X if / is upper semi- 
continuous at every point x G X. 

Definition 1.2.3 A function / : X — > R is said to be lower semi- continuous on X if — / 
is upper semi-continuous. 

It is clear that a function / : X — > R is continuous on X if and only if it is both 
upper and lower semi-continuous, or equivalently, for all x G X, the upper contour set 
U(x) = {x' G X : f(x') ^ f(x)} and the lower contour set L(x) = {x' G X : f(x') ^ f(x)} 
are closed subsets of X. 

Let / be a function on M. k with continuous partial derivatives. We define the gradient 
of / to be the vector 

Df(x) 



df{x) df(x) df(x) 



dxi ' dx 2 ' ' dxk 
Suppose / has continuous second order partial derivatives. We define the Hessian of 

/ at x to be the n x n matrix denoted by D 2 f(x) as 

'd 2 f(x) 



D 2 f(x) 
which is symmetric since 



dxidxj 



d 2 f(x) _ d 2 f(x) 
dxidxj dxjdxi 
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Definition 1.2.4 A function / : X — > R is said to be homogeneous of degree k if 



An important result concerning homogeneous function is the following: 

Theorem 1.2.1 (Euler's Theorem) If a function f : W l — > R is homogeneous of 
degree k if and only if 



1.2.2 Separating Hyperplane Theorem 

A set X C lR n is said to be compact if it is bounded and closed. A set X is said to be 
convex if for any two points x,x' G X, the point tx + (1 — t)x' G X for all ^ t ^ 1. 
Geometrically the convex set means every point on the line segment joining any two points 
in the set is also in the set. 

Theorem 1.2.2 (Separating Hyperplane Theorem) Suppose that A, B C W 11 are 
convex and An B = 0. Then, there is a vector p G R m with p ^ 0, and a value c G R 
such that 



Furthermore, suppose that B C M m is convex and closed, A C M. m is convex and compact, 
and A n B = 0. Then, there is a vector p G M. m with p ^ 0, and a value c6l such that 



1.2.3 Concave and Convex Functions 

Concave, convex, and quasi-concave functions arise frequently in microeconomics and 
have strong economic meanings. They also have a special role in optimization problems. 

Definition 1.2.5 Let X be a convex set. A function / : X — >■ R is said to be concave 
on X if for any x,x' G X and any t with ^ t ^ 1, we have 



f(tx) = t k f(x) 




px ^ c ^ py Vx <E A $z y E B. 



px < c < py \/x G A & y G B. 



f(tx + (1 - t)x 



') ^ tf(x) + (1 



WW 



The function / is said to be strictly concave on X if 



f(tx + (l-t)x')>tf(x) + (l 



t)f(x') 
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for all x ^ x' e X an < t < I. 

A function f : X — > R is said to be (strictly) convex on X if —f is (strictly) concave 
on X . 

Remark 1.2.1 A linear function is both concave and convex. The sum of two concave 
(convex) functions is a concave (convex) function. 

Remark 1.2.2 When a function / defined on a convex set X has continuous second 
partial derivatives, it is concave (convex) if and only if the Hessian matrix D 2 f(x) is 
negative (positive) semi-definite on X. It is it is strictly concave (strictly convex) if the 
Hessian matrix D 2 f(x) is negative (positive) definite on X. 

Remark 1.2.3 The strict concavity of /(x) can be checked by verifying if the leading 
principal minors of the Hessian must alternate in sign, i.e., 





/ll 


/l2 




/21 


/22 


/ll 


/l2 


/l3 


/21 


/22 


/23 


/31 


/32 


/33 



> 0, 
< 0, 



and so on, where = dx J x - This algebraic condition is useful for checking second-order 
conditions. 

In economic theory quasi-concave functions are used frequently, especially for the 
representation of utility functions. Quasi-concave is somewhat weaker than concavity. 

Definition 1.2.6 Let X be a convex set. A function / : X — > R is said to be quasi- 
concave on X if the set 

{x e X : f(x) ^ c} 
is convex for all real numbers c. It is strictly quasi- concave on X if 

{x e X : f(x) > c} 

is convex for all real numbers c. 

A function / : X — > R is said to be (strictly) quasi-convex on X if — / is (strictly) 
quasi-concave on X. 
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Remark 1.2.4 The sum of two quasi-concave functions in general is not a quasi-concave 
function. Any monotonic function defined on a subset of the one dimensional real space 
is both quasi-concave and quasi-convex. 

Remark 1.2.5 When a function / defined on a convex set X has continuous second 
partial derivatives, it is strictly quasi-concave (convex) if the naturally ordered principal 
minors of the bordered Hessian matrix H(x) alternate in sign, i.e., 








h 


h 






h 


hi 


fl2 


> 0, 




h 


hi 


f22 







h 


h 


h 




fl 


hi 


fl2 


fl3 


< 0, 










h 


hi 


J22 


f23 




h 


hi 


J'32 


fs3 





and so on. 

1.2.4 Optimization 

Optimization is a fundamental tool for the development of modern microeconomics anal- 
ysis. Most economic models are based on the the solution of optimization problems. 
Results of this subsection are used throughout the text. 

The basic optimization problem is that of maximizing or minimizing a function on 
some set. The basic and central result is the existence theorem of Weierstrass. 

Theorem 1.2.3 (Weierstrass Theorem) Any upper (lower) semi continuous function 
reaches its maximum (minimum) on a compact set, and the set of maximum is compact. 

EQUALITY CONSTRAINED OPTIMIZATION 
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An optimization problem with equality constraints has the form 

max/(x) 
such that ft-i(x) = d\ 
/i 2 (x) = d 2 

/i*.(x) = 4, 

where /, /ii, . . . , hk are differentiable functions defined on W 1 and k < n and di, . . . , d& 
are constants. 

The most important result for constrained optimization problems is the Lagrange 
multiplier theorem, giving necessary conditions for a point to be a solution. 
Define the Lagrange function: 

k 

L(x, A) = f(x) + ^ M d i ~ hi(x)], 
i=i 

where Ai, . . . , A& are called the Lagrange multipliers. 

The necessary conditions for x to solve the maximization problem is that there are 
Ai, . . . , Afc such that the first-order conditions (FOC) are held: 

_ df(x) _ A dhi(x) _ _ 
f> ~ f) / j A l p) ~ u i — i, ■ ■ ■ , ri>- 

INEQUALITY CONSTRAINED OPTIMIZATION 

Consider an optimization problem with inequality constraints: 

max /(x) 

such that fi'i(x) ^ dj i — 1, 2, . . . , k. 

A point x making all constraints held with equality (i.e., gi{x) = di for all i) is said to 
satisfy the constrained qualification condition if the gradient vectors, Dg^x), Dg 2 (x), . . . , Dg k 
are linearly independent. 

Theorem 1.2.4 (Kuhn- Tucker Theorem) Suppose x solves the inequality constrained 
optimization problem and satisfies the constrained qualification condition. Then, there are 
a set of Kuhn-Tucker multipliers Aj ^ ; i — 1, . . . , k, such that 

k 

Df(x) = J2^Dg i (x). 

i=i 
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Furthermore, we have the complementary slackness conditions: 



Xi ^0 for alii = 1,2,..., k 



A; = if g%{x) < Di. 



Comparing the Kuhn- Tucker theorem to the Lagrange multipliers in the equality con- 
strained optimization problem, we see that the major difference is that the signs of the 
Kuhn- Tucker multipliers are nonnegative while the signs of the Lagrange multipliers can 
be anything. This additional information can occasionally be very useful. 

The Kuhn- Tucker theorem only provides a necessary condition for a maximum. The 
following theorem states conditions that guarantee the above first-order conditions are 
sufficient. 

Theorem 1.2.5 (Kuhn- Tucker Sufficiency) Suppose f is concave and each gi is con- 
vex. If x satisfies the Kuhn- Tucker first- order conditions specified in the above theorem, 
then x is a global solution to the constrained optimization problem. 

We can weaken the conditions in the above theorem when there is only one constraint. 
Let C = {x G R n : g(x) = d}. 

Proposition 1.2.1 Suppose f is quasi-concave and the set C is convex (this is true if g 
is quasi- convex). If x satisfies the Kuhn- Tucker first- order conditions, then x is a global 
solution to the constrained optimization problem. 

Sometimes we require x to be nonnegative. Suppose we had optimization problem: 

max/(x) 

such that <7»(x) = di i = 1,2, . . . , k 
x^O. 

Then the Lagrange function in this case is given by 



k 



n 




1=1 



3=1 
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where /ii, . . . ,Hk are the multipliers associated with constraints Xj ^ 0. The first-order 
conditions are 

L(x,X) df(x) A dgi(x) . 

— a = ~a /L A ^ + A*i = i = l,2,...,ra 

OXi OXi OXi 

Xi ^ l = l,2,...,k 
Xi = if ^(z) < 
fii ^ i = 1, 2, ...,n 
Hi = if Xj > 0. 

Eliminating /ij, we can equivalently write the above first-order conditions with nonnegative 
choice variables as 

^^ = ^-E A *%^° with equality if x, > < = l,2,...,n, 
or in matrix notation, 

Df-XDg^O 
x[Z>/ - AL>g] = 

where we have written the product of two vector x and y as the inner production, i.e., 
xy = Yli=i x iUi- Thus, if we are at an interior optimum (i.e., Xi > for all i), we have 

Df(x) = XDg. 
1.2.5 The Envelope Theorem 

Consider an arbitrary maximization problem where the objective function depends on 
some parameter a: 

M{a) = max /(re, a). 

X 

The function M(a) gives the maximized value of the objective function as a function of 
the parameter a. 

Let x(a) be the value of x that solves the maximization problem. Then we can also 
write M(a) = f(x(a),a). It is often of interest to know how M(a) changes as a changes. 
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The envelope theorem tells us the answer: 



dM(a) df(x,a) 



da 



da 



x=x(a) 



This expression says that the derivative of M with respect to a is given by the partial 
derivative of / with respect to a, holding x fixed at the optimal choice. This is the meaning 
of the vertical bar to the right of the derivative. The proof of the envelope theorem is a 
relatively straightforward calculation. 

Now consider a more general parameterized constrained maximization problem of the 
form 

M(a) = max g(x±, x 2 , a) 

Xl,X2 

such that h(xi, x 2 , a) = 0. 
The Lagrangian for this problem is 



C = g(xi,x 2 , a) - Xh(x 1: x 2 , a), 



and the first-order conditions are 



dg 
dx\ 
dg 



A 



-A 



dh 
dx\ 
dh 



dx 2 dx 2 
h(xi,x 2 ,a) 



= 

= 
= 0. 



(1.1) 



These conditions determine the optimal choice functions (xi(a),x 2 (a),a), which in turn 
determine the maximum value function 



M(a) = g(xi(a),x 2 (a)). 



1.2) 



The envelope theorem gives us a formula for the derivative of the value function with 
respect to a parameter in the maximization problem. Specifically, the formula is 



dM(a) dC(x, a) 



da 



da 



x=x(a) 



dg{xi,x 2 ,a) 
da 



-A 



dh(x±,x 2 , a) 



Xi — X{ (ft) 
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da 



Xi=Xi(a) 



As before, the interpretation of the partial derivatives needs special care: they are the 
derivatives of g and h with respect to a holding x\ and rr 2 fixed at their optimal values. 

1.2.6 Point-to-Set Mappings 

When a mapping is not a single- valued function, but is a point-to-set mapping, it is called 
a correspondence, or multi-valued functions. That is, a correspondence F maps point x 
in the domain X C R n into sets in the range Y C M m , and it is denoted by F : X — > 2 Y . 
We also use F : X —)>—>■ Y to denote the mapping F : X — > 2 Y in this lecture notes. 

Definition 1.2.7 A correspondence F : X — >■ 2 Y is: (1) non-empty valued if the set 
is non-empty for all rr G X; (2) convex valued if the set is a convex set for all x G X; 
(3) closed valued if the set F(a;) is a closed set for all x G X; (4) compact valued if the set 
is a compact set for all x G X. 

Intuitively, a correspondence is continuous if small changes in x produce small changes 
in the set F(x). Unfortunately, giving a formal definition of continuity for correspondences 
is not so simple. Figure 1.1 shows a continuous correspondence. 




x x' 



Figure 1.1: A Continuous correspondence. 

The notions of hemi- continuity are usually defined in terms of sequences (see Debreu 
(1959) and Mask-Collell et al. (1995)), but, although they are relatively easy to verify, they 
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are not intuitive and depend on on the assumption that a correspondence is compacted- 
valued. The following definitions are more formal (see, Border, 1988). 

Definition 1.2.8 A correspondence F : X — > 2 Y is upper hemi- continuous at x if for 
each open set U containing F(x), there is an open set N(x) containing x such that if 
x' G N(x), then F(x') C U . A correspondence F : X — > 2 Y is upper hemi- continuous if it 
is upper hemi-continuous at every x G X, or equivalently, if the set {x E X : F(x) C V} 
is open in X for every open set subset V of Y. 

Remark 1.2.6 Upper hemi-continuity captures the idea that F(x) will not "suddenly 
contain new points" just as we move past some point x , in other words, F(x) does not 
suddenly becomes much larger if one changes the argument x slightly. That is, if one 
starts at a point x and moves a little way to x', upper hemi-continuity at x implies that 
there will be no point in F(x') that is not close to some point in F(x). 

Definition 1.2.9 A correspondence F : X — > 2 Y is said to be lower hemi-continuous 
at x if for every open set V with F(x) fl V ^ 0, there exists a neighborhood N(x) of 
x such that F(x') fl V ^ for all x' G N(x). A correspondence F : X — > 2 Y is /ower 
hemi-continuous if it is lower hemi-continuous at every x G X, or equivalently, the set 
{xGl: fl V 7^ 0} is open in X for every open set V of F. 

Remark 1.2.7 Lower hemi-continuity captures the idea that any element in F(x) can be 
"approached" from all directions, in other words, F(x) does not suddenly becomes much 
smaller if one changes the argument x slightly. That is, if one starts at some point x and 
some point y G F(x), lower hemi-continuity at x implies that if one moves a little way 
from x to x', there will be some y' G F(x') that is close to y. 

Remark 1.2.8 Based on the following two facts, both notions of hemi-continuity can be 
characterized by sequences. 

(a) If a correspondence F : X — > 2 Y is compacted-valued, then it is upper 
hemi-continuous if and only if for any {x^} with Xk — > x and {y^} with 
y n G F(x k ), there exists a converging subsequence {y km } of {y k }, y km ->■ y, 
such that y G F(x). 
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(b) A correspondence F : X — > 2 Y is said to be lower hemi- continuous at x 
if and only if for any {x k } with x k — > x and y G F(x), then there is a 
sequence {y k } with y k ->■ y and y n G 

Definition 1.2.10 A correspondence F : X — >■ 2 y is said to be closed at x if for any {x k } 
with Xfc — > x and {y^} with y k y and y n G F[x k ) implies y G F is said to be 

closed if F is closed for all x G X or equivalently 

Gr(F) = {(x,y) G X x F : y G F(z)} is closed. 

Remark 1.2.9 Regarding the relationship between upper hemi-continuity and closed 
graph, the following facts can be proved. 

(i) If F is compact and F : X ^2 Y is closed-valued, then F has closed graph 
implies it is upper hemi-continuous. 

(ii) If X and F are closed and F : X — > 2 Y is closed-valued, then F is upper 
hemi-continuous implies that it has closed graph. 

Because of fact (i), a correspondence with closed graph is sometimes called upper 
hemi-continuity in the literature. But one should keep in mind that they are not the 
same in general. For example, let F : R + — > 2 R be defined by 



The correspondence is closed but not upper hemi-continuous. 

Also, define F : R + — > 2 R by F(x) = (0, 1). Then F is upper hemi-continuous but not 
closed. 

Figure 1.2 shows the correspondence is upper hemi-continuous, but not lower hemi- 
continuous. To see why it is upper hemi-continuous, imagine an open interval U that 
encompasses F(x). Now consider moving a little to the left of a; to a point x'. Clearly 
F(x') = {y} is in the interval. Similarly, if we move to a point x' a little to the right of 
x, then F(x) will inside the interval so long as x' is sufficiently close to x. So it is upper 
hemi-continuous. On the other hand, the correspondence it not lower hemi-continuous. 
To see this, consider the point y G F(x), and let U be a very small interval around y that 
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does not include y. If we take any open set N(x) containing x, then it will contain some 
point x' to the left of x. But then F(x') = {y} will contain no points near y, i.e., it will 
not interest U. 



X = Y = [0,oo). 



Figure 1.2: A correspondence that is upper hemi-continuous, but not lower hemi- 
continuous. 



Figure 1.3 shows the correspondence is lower hemi-continuous, but not upper hemi- 
continuous. To see why it is lower hemi-continuous. For any ^ x' ^ x, note that 
F(x') = {y}. Let x n — x' — 1/n and let y n = y. Then x n > for sufficiently large n, 
x n — > x', y n — > y, and y n e F(x n ) = {y}. So it is lower hemi-continuous. It is clearly 
lower hemi-continuous for Xi > x. Thus, it is lower hemi-continuous on X. On the other 
hand, the correspondence it not upper hemi-continuous. If we start at x by noting that 
F(x) = {y}, and make a small move to the right to a point x', then F(x') suddenly 
contains may points that are not close to y. So this correspondence fails to be upper 
hemi-continuous. 

Combining upper and lower hemi-continuity, we can define the continuity of a corre- 
spondence. 

Definition 1.2.11 A correspondence F : X — > 2 Y at x G X is said to be continuous if it 
is both upper hemi-continuous and lower hemi-continuous at x G X. A correspondence 
F : X — > 2 Y is said to be continuous if it is both upper hemi-continuous and lower 
hemi-continuous. 
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X = Y = [0,oo). 
x 

Figure 1.3: A correspondence that is lower hemi-continuous, but not upper hemi- 
continuous. 

Remark 1.2.10 As it turns out, the notions of upper and hemi-continuous correspon- 
dence both reduce to the standard notion of continuity for a function if F(-) is a single- 
valued correspondence, i.e., a function. That is, F(-) is a single-valued upper (or lower) 
hemi-continuous correspondence if and only if it is a continuous function. 

Definition 1.2.12 A correspondence F : X — > 2 Y said to be open if ifs graph 

Gr(F) = {(x,y) G X x Y : y G F(x)} is open. 

Definition 1.2.13 A correspondence F : X — > 2 Y said to have upper open sections if 
F(x) is open for all x G X. 

A correspondence F : X — > 2 Y said to have /ower open sections if its inverse set 
F^ 1 (y) = {x e X : y e F(x)} is open. 

Remark 1.2.11 If a correspondence F : X — >■ 2 y has an open graph, then it has upper 
and lower open sections. If a correspondence F : X — > 2 Y has lower open sections, then 
it must be lower hemi-continuous. 

1.2.7 Continuity of a Maximum 

In many places, we need to check if an optimal solution is continuous in parameters, say, 
to check the continuity of the demand function. We can apply the so-called Maximum 
Theorem. 
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Theorem 1.2.6 (Berg's Maximum Theorem) Suppose f(x,a) is a continuous func- 
tion mapping from A x X — > M., and the constraint set F : A —)■—)■ X is a continuous 
correspondence with non-empty compact values. Then, the optimal valued function (also 
called marginal function) : 

M(a) = max f(x,a) 

x£F(a) 

is a continuous function, and the optimal solution: 

0(a) = arg max f(x,a) 

xGF(a) 

is a upper hemi- continuous correspondence. 
1.2.8 Fixed Point Theorems 

To show the existence of a competitive equilibrium for the continuous aggregate excess 
demand function, we will use the following fixed-point theorem. The generalization of 
Brouwer's fixed theorem can be found in Tian (1991) that gives necessary and sufficient 
conditions for a function to have a fixed point. 

Theorem 1.2.7 (Brouwer's Fixed Theorem) Let X be a non-empty, compact, and 
convex subset ofM. m . If a function f : X — >■ X is continuous on X, then f has a fixed 
point, i.e., there is a point x* e X such that f(x*) = x* . 

f(x) 



Figure 1.4: Fixed points are given by the intersections of the 45° line and the curve of 
the function. There are three fixed points in the case depicted. 
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Example 1.2.1 / : [0, 1] — > [0, 1] is continuous, then / has a fixed point (x). To see this, 
let 

g{x) = f(x) - X. 

Then, we have 
9(0) = 7(0) £ 
g(l) = /(l) - 1 ^ 0. 

From the mean- value theorem, there is a point x* G [0, 1] such that g(x*) = f(x*) — 
x* = 0. 

When a mapping is a correspondence, we have the following version of fixed point 
theorem. 

Theorem 1.2.8 (Kakutani's Fixed Point Theorem) Let X be a non-empty, com- 
pact, and convex subset of M. m . If a correspondence F : X — > 2 X is a upper hemi- 
continuous correspondence with non-empty compact and convex values on X , then F has 
a fixed point, i.e., there is a point x* G X such that x* e F(x*). 

The Knaster-Kuratowski-Mazurkiewicz (KKM) lemma is quite basic and in some ways 
more useful than Brouwer's fixed point theorem. The following is a generalized version of 
KKM lemma due to Ky Fan (1984). 

Theorem 1.2.9 (FKKM Theorem) Let Y be a convex set and <f_ X C Y . Suppose 
F : X — >■ 2 Y is a correspondence such that 

(1) F(x) is closed for all x G X ; 

(2) F(xq) is compact for some x G X; 

(3) F is FS-convex, i.e, for any x±, . . . ,x m G X and its convex combination 
x ^ = Yn=i ^ x i> we have x ^ e ^lLiF(xi). 

Then n xeX F(x) ^ 0. 

Here, The term FS is for Fan (1984) and Sonnenschein (1971), who introduced the notion 
of FS-convexity. 

The various characterization results on Kakutani's Fixed Point Theorem, KKM Lem- 
ma, and Maximum Theorem can be found in Tian (1991, 1992, 1994) and Tian and Zhou 
(1992). 
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Part I 

Individual Decision Making 
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Part I is devoted to the theories of individual decision making and consists of three 
chapters: consumer theory, producer theory, and choice under uncertainty. It studies how 
a consumer or producer selects an appropriate action or making an appropriate decision. 
Microeconomic theory is founded on the premise that these individuals behave rational- 
ly, making choices that are optimal for themselves. Throughout this part, we restrict 
ourselves to an ideal situation (benchmark case) where the behavior of the others are 
summarized in non-individualized parameters - the prices of commodities, each individ- 
ual makes decision independently by taking prices as given and individuals' behavior are 
indirectly interacted through prices. 

We will treat consumer theory first, and at some length - both because of its intrinsic 
importance, and because its methods and results are paradigms for many other topic 
areas. Producer theory is next, and we draw attention to the many formal similarities 
between these two important building blocks of modern microeconomics. Finally, we 
conclude our treatment of the individual consumer by looking at the problem of choice 
under uncertainty. It is greatly acknowledged that the materials are largely drawn from 
the references in the end of each charter, especially from Varian (1992). 
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Chapter 2 
Consumer Theory 



2.1 Introduction 

In this chapter, we will explore the essential features of modern consumer theory — a 
bedrock foundation on which so many theoretical structures in economics are build, and 
it is also central to the economists' way of thinking. 

A consumer can be characterized by many factors and aspects such clS SGX, cl ge, lifestyle, 
wealth, parentage, ability, intelligence, etc. But which are most important ones for us to 
study consumer's behavior in making choices? To grasp the most important features in 
studying consumer behavior and choices in modern consumer theory, it is assumed that the 
key characteristic of a consumer consists of three essential components: the consumption 
set, initial endowments, and the preference relation. Consumer's characteristic together 
with the behavior assumption are building blocks in any model of consumer theory. The 
consumption set represents the set of all individually feasible alternatives or consumption 
plans and sometimes also called the choice set. An initial endowment represents the 
amount of various goods the consumer initially has and can consume or trade with other 
individuals. The preference relation specifies the consumer's tastes or satisfactions for the 
different objects of choice. The behavior assumption expresses the guiding principle the 
consumer uses to make final choices and identifies the ultimate objects in choice. It is 
generally assumed that the consumer seeks to identify and select an available alternative 
that is most preferred in the light of his/her personal tasts/interstes. 



32 



2.2 Consumption Set and Budget Constraint 
2.2.1 Consumption Set 



Hours 1 1 
24 



Bread 



Figure 2.1: The left figure: A consumption set that reflects legal limit on the number of 
working hours. The right figure: the consumption set R+. 

We consider a consumer faced with possible consumption bundles in consumption 
set X. We usually assume that X is the nonnegative orthant in R L as shown in the right 
figure in Figure 2.1, but more specific consumption sets may be used. For example, it may 
allow consumptions of some good in a suitable interval as such leisure as shown in the 
left figure in Figure 2.1, or we might only include bundles that would give the consumer 
at least a subsistence existence or that consists of only integer units of consumptions as 
shown in Figure 2.2. 

We assume that X is a closed and convex set unless otherwise stated. The convexity 
of a consumption set means that every good is divisible and can be consumed by fraction 
units. 

2.2.2 Budget Constraint 

In the basic problem of consumer's choice, not all consumptions bundles are affordable in 
a limited resource economy, and a consumer is constrained by his/her wealth. In a mar- 
ket institution, the wealth may be determined by the value of his/her initial endowment 
and/or income from stock- holdings of firms. It is assumed that the income or wealth 
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Figure 2.2: The left figure: A consumption set that reflects survival needs. The right 
figure: A consumption set where good 2 must be consumed in integer amounts. 

of the consumer is fixed and the prices of goods cannot be affected by the consumer's 
consumption when discussing a consumer's choice. Let m be the fixed amount of money 
available to a consumer, and let p = (pi, ■■■,Pl) be the vector of prices of goods, 1, . . . , L. 
The set of affordable alternatives is thus just the set of all bundles that satisfy the con- 
sumer's budget constraint. The set of affordable bundles, the budget set of the consumer, 
is given by 



where px is the inner product of the price vector and consumption bundle, i.e., px = 
Y^d=\Pi x i which is the sum of expenditures of commodities at prices p. The ratio, Pi/pk, 
may be called the economic rate of substitution between goods % and j. Note that 
multiplying all prices and income by some positive number does not change the budget 
set. 

Thus, the budget set reflects consumer's objective ability of purchasing commodities 
and the scarcity of resources. It significantly restricts the consumer choices. To determine 
the optimal consumption bundles, one needs to combine consumer objective ability of 
purchasing various commodities with subjective taste on various consumptions bundles 
which are characterized by the notion of preference or utility. 



B(p, m) = {x G X : px ^ m, } 
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Figure 2.3: The left figure: A budget set. The right figure: The effect of a price change 
on the budget set. 

2.3 Preferences and Utility 
2.3.1 Preferences 

The consumer is assumed to have preferences on the consumption bundles in X so that 
he can compare and rank various goods available in the economy. When we write x y y, 
we mean "the consumer thinks that the bundle x is at last as good as the bundle y." We 
want the preferences to order the set of bundles. Therefore, we need to assume that they 
satisfy the following standard properties. 

COMPLETE. For all x and y in X, either x y y or y y x or both. 
REFLEXIVE. For all x in X, x y x. 

TRANSITIVE. For all x, y and z in X, if x ^ y and y y z, then x y z. 

The first assumption is just says that any two bundles can be compared, the second 
is trivial and says that every consumption bundle is as good as itself, and the third 
requires the consumer's choice be consistent. A preference relation that satisfies these 
three properties is called a preference ordering. 

Given an ordering y describing "weak preference," we can define the strict prefer- 
ence y by x y y to mean not y y x . We read x y y as "x is strictly preferred to y." 
Similarly, we define a notion of indifference by x ~ y if and only if x y y and y y x. 

Given a preference ordering, we often display it graphically, as shown in Figure 2.4. 
The set of all consumption bundles that are indifferent to each other is called an indiffer- 
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ence curve. For a two-good case, the slope of an indifference curve at a point measures 
marginal rate of substitution between goods x\ and x 2 . For a L-dimensional case, the 
marginal rate of substitution between two goods is the slope of an indifference surface, 
measured in a particular direction. 



For a given consumption bundle y, let P(y) = {x in I: x >z y} be the set of all 
bundles on or above the indifference curve through y and it is called the upper contour 
set at y, P s {y) = {x in X: x y y} be the set of all bundles above the indifference curve 
through y and it is called the strictly upper contour set at y, L(y) = {x in X: x ■< y} 
be the set of all bundles on or below the indifference curve through y and it is called the 
lower contour set at y , and L s (y) = {x in I: x ■< y} be the set of all bundles on or 
below the indifference curve through y and it is called the strictly lower contour set 
at y. 

We often wish to make other assumptions on consumers' preferences; for example. 

CONTINUITY. For all y in X, the upper and lower contour sets P(y) and L(y), are 
closed. It follows that the strictly upper and lower contour sets, P s (y) and L s (y), are 
open sets. 

This assumption is necessary to rule out certain discontinuous behavior; it says that 
if (x l ) is a sequence of consumption bundles that are all at least as good as a bundle y, 
and if this sequence converges to some bundle x*, then x* is at least as good as y. The 
most important consequence of continuity is this: if y is strictly preferred to z and if x is 
a bundle that is close enough to y, then x must be strictly preferred to z. 




Figure 2.4: Preferences in two dimensions. 
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Example 2.3.1 (Lexicographic Ordering) An interesting preference ordering is the 
so-called lexicographic ordering defined on R L , based on the way one orders words alpha- 
betically. It is defined follows: x y y if and only if there is a I, 1 ^ I ^ L, such that 
Xi = yi for % < I and x\ > y\ or if xi = yi for alH = 1, . . . , L. Essentially the lexicographic 
ordering compares the components on at a time, beginning with the first, and determines 
the ordering based on the first time a different component is found, the vector with the 
greater component is ranked highest. However, the lexicographic ordering is not contin- 
uous or even not upper semi-continuous, i.e., the upper contour set is not closed. This is 
easily seen for the two-dimensional case by considering the upper contour correspondence 
to y — (1, 1), that is, the set P(l, 1) = {x e X : x y (1, 1)} as shown in Figure 2.5. It is 
clearly not closed because the boundary of the set below (1,1) is not contained in the set. 

X 2 i 



1 

l Xl 

Figure 2.5: Preferred set for lexicographic ordering. 

There are two more assumptions, namely, monotonicity and convexity, that are often 
used to guarantee nice behavior of consumer demand functions. We first give various 
types of monotonicity properties used in the consumer theory. 

WEAK MONOTONICITY. // x ^ y then x y y. 

MONOTONICITY. // x > y, then x y y. 

STRONG MONOTONICITY. // x ^ y and x ^ y, then x y y. 

Weak monotonicity says that "at least as much of everything is at least as good," 
which ensures a commodity is a "good" , but not a "bad" . Monotonicity says that strictly 
more of every good is strictly better. Strong monotonicity says that at least as much of 
every good, and strictly more of some good, is strictly better. 
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Another assumption that is weaker than either kind of monotonicity or strong mono- 
tonicity is the following: 

LOCAL NON-SATIATION. Given any x in X and any e > 0, then there is some 
bundle y in X with |x — y| < e such that y y x. 

NON-SATIATION. Given any x in X, then there is some bundle y in X such that 
y y x. 

Remark 2.3.1 Monotonicity of preferences can be interpreted as individuals' desires for 
goods: the more, the better. Local non-satiation says that one can always do a little 
bit better, even if one is restricted to only small changes in the consumption bundle. 
Thus, local non-satiation means individuals' desires are unlimited. You should verify 
that (strong) monotonicity implies local non-satiation and local non-satiation implies 
non-satiation, but not vice versa. 

We now give various types of convexity properties used in the consumer theory. 



STRICT CONVEXITY. Given x,x' in X such that x' ±_ x, then it follows that tx + 
(1 - t)x' y x for all < t < 1. 



CONVEXITY. Given x, x' in X such that x' y x, then it follows that tx + (1 - t)x' y x 
for all ^ t < 1. 

WEAK CONVEXITY. Given x, x' in X such that x' y x, then it follows that tx + (l- 
t)x' y x for all ^ t ^ 1. 




X 



Figure 2.6: Strict convex indifference curves 
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Figure 2.7: Linear indifference curves are convex, but not strict convex. 




Figure 2.8: "Thick" indifference curves are weakly convex, but not convex. 

Remark 2.3.2 The convexity of preferences implies that people want to diversify their 
consumptions (the consumer prefers averages to extremes), and thus, convexity can be 
viewed as the formal expression of basic measure of economic markets for diversification. 
Note that convex preferences may have indifference curves that exhibit "flat spots," while 
strictly convex preferences have indifference curves that are strictly rotund. The strict 
convexity of implies the neoclassical assumption of "diminishing marginal rates of 
substitution" between any two goods as shown in Figure 2.9. 
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Figure 2.9: The marginal rate of substitution is diminishing when we the consumption of 
good 1 increases. 

2.3.2 The Utility Function 

Sometimes it is easier to work directly with the preference relation and its associated 
sets. But other times, especially when one wants to use calculus methods, it is easier to 
work with preferences that can be represented by a utility function; that is, a function 
u: X — >■ R such that x y y if and only if u(x) ^ u(y). In the following, we give some 
examples of utility functions. 

Example 2.3.2 (Cobb-Douglas Utility Function) A utility function that is used fre- 
quently for illustrative and empirical purposes is the Cobb-Douglas utility function, 

u(x±,X2, • • • , Xl) = x^x^ 2 ■ ■ ■ x L L 

with oti > 0, I — 1, . . . , L. This utility function represents a preference ordering that is 
continuous, strictly monotonic, and strictly convex in 

Example 2.3.3 (Linear Utility Function) A utility function that describes perfect 
substitution between goods is the linear utility function, 

u(xi ) x 2) ■ ■ ■ ,x L ) = aixi + a 2 x 2 + ■ ■ ■ + a L x L 

with ai ^ for all I — 1, . . . , L and ai > for at least I. This utility function represents 
a preference ordering that is continuous, monotonic, and convex in 
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Example 2.3.4 (Leontief Utility Function) A utility function that describes perfect 
complement between goods is the Leontief utility function, 

u(xi,x 2 , ...,x L ) = min{aixi, a 2 x 2 , . . . , a L x L } 

with ai ^ for all Z = 1, . . . , L and ai > for at least I. This represents a preference 
that all commodities should be used together in order to increase consumer utility. This 
utility function represents a preference ordering that is also continuous, monotonic, and 
convex in 

Not all preference orderings can be represented by utility functions, but it can be shown 
that any (upper semi-) continuous preference ordering can be represented by a (upper 
semi-) continuous utility function. We now prove a weaker version of this assertion. The 
following proposition shows the existence of a utility function when a preference ordering 
is continuous and strictly monotonic. 

Theorem 2.3.1 (Existence of a Utility Function) Suppose preferences are complete, 
reflexive, transitive, continuous, and strongly monotonic. Then there exists a continuous 
utility function u: R+ R which represents those preferences. 

Proof. Let e be the vector in R^_ consisting of all ones. Then given any vector x let 
u(x) be that number such that x ~ -u(x)e. We have to show that such a number exists 
and is unique. 

Let B = {t in R: te y x} and W = {t in R: x y te}. Then strong monotonicity 
implies B is nonempty; W is certainly nonempty since it contains 0. Continuity implies 
both sets are closed. Since the real line is connected, there is some t x such that t x e ~ x. 
We have to show that this utility function actually represents the underlying preferences. 
Let 

w(x) = t x where t x e ~ x 
u (y) = ty where t y e ~ y. 
Then if t x < t y , strong monotonicity shows that t x e -< t y e, and transitivity shows that 

x ~ t x e -< t y e ~ y. 

Similarly, if x >- y, then t x e y t y e so that t x must be greater than t y . 
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Finally, we show that the function u defined above is continuous. Suppose {xk} is a 
sequence with Xk —> x. We want to show that u(xk) — > u(x). Suppose not. Then we can 
find e > and an infinite number of k's such that u(xk) > u(x) + e or an infinite set of 
k's such that u(xk) < u(x) — e. Without loss of generality, let us assume the first of these. 
This means that Xk ~ u(xk)e >- (u(x) + e)e ~ x + ee. So by transitivity, Xk >- x + ee. But 
for a large k in our infinite set, x + ee > Xk, so x + ee >- Xk, contradiction. Thus u must 
be continuous. 

The following is an example of the non-existence of utility function when preference 
ordering is not continuous. 

Example 2.3.5 (Non-Representation of Lexicographic Ordering by a Function) 

Given a upper semi-continuous utility function u, the upper contour set {x e X : u(x) ^ 
u} must be closed for each value of u. It follows that the lexicographic ordering defined 
on R L discussed earlier cannot be represented by a upper semi-continuous utility function 
because its upper contour sets are not closed. 

The role of the utility function is to efficiently record the underlying preference or- 
dering. The actual numerical values of u have essentially no meaning: only the sign of 
the difference in the value of u between two points is significant. Thus, a utility function 
is often a very convenient way to describe preferences, but it should not be given any 
psychological interpretation. The only relevant feature of a utility function is its ordinal 
character. Specifically, we can show that a utility function is unique only to within an 
arbitrary, strictly increasing transformation. 

Theorem 2.3.2 (Invariance of Utility Function to Monotonic Transforms) Ifu(x) 
represents some preferences y and f: R — )■ R is strictly monotonic increasing, then 
/(w(x)) will represent exactly the same preferences. 

Proof. This is because f(u(x)) ^ f(u(y)) if and only if u(x) ^ u(y). 

This invariance theorem is useful in many aspects. For instance, as it will be shown, 
we may use it to simplify the computation of deriving a demand function from utility 
maximization. 

We can also use utility function to find the marginal rate of substitution between 
goods. Let u(xi, ...,Xk) be a utility function. Suppose that we increase the amount of 
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good i; how does the consumer have to change his consumption of good j in order to keep 
utility constant? 

Let dxi and dxj be the differentials of Xi and Xj. By assumption, the change in utility 
must be zero, so 

du(x) , <9(x) , 
— y -^dx t + -z-t-dxj = 0. 
axi dxj 

Hence 

dxj ^ff- _ MU Xi 



dxi " MU X . 

dX4 ] 



which gives the marginal rate of substitution between goods % and j and is defined as the 
ratio of the marginal utility of Xi and the marginal utility of Xj. 

Remark 2.3.3 The marginal rate of substitution does not depend on the utility function 
chosen to represent the underlying preferences. To prove this, let v(u) be a monotonic 
transformation of utility. The marginal rate of substitution for this utility function is 

<** <^ 

The important properties of a preference ordering can be easily verified by examining 
utility function. The properties are summarized in the following proposition. 

Proposition 2.3.1 Let y be represented by a utility function u : X — > R. Then: 

(1) An ordering is strictly monotonic if and only if u is strictly monotonic. 

(2) An ordering is continuous if and only if u is continuous. 

(3) An ordering is weakly convex if and only if u is quasi-concave. 

(4) An ordering is strictly convex if and only if u is strictly quasi- concave. 

Note that a function u is quasi-concavity if for any c, u(x) ^ c and u(y) ^ c implies 
that u(tx + (l —t)y) ^ c for all t with < t < 1. A function u is strictly quasi-concavity if 
for any c u(x) ^ c and u(y) ^ c implies that u(tx + (1 — t)y) > c for all t with < t < 1. 
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Remark 2.3.4 The strict quasi-concave of m(x) can be checked by verifying if the natu- 
rally ordered principal minors of the bordered Hessian alternate in sign, i.e., 
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and so on, where Ui = and m^ = g^^, . 

Example 2.3.6 Suppose the preference ordering is represented by the Cobb-Douglas 
utility function : u(x 1 ,x 2 ) = x^x^ with a > and (3 > 0. Then, we have 
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which means u is strictly quasi-concave in M.\ 
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2.4 Utility Maximization and Optimal Choice 



2.4.1 Consumer Behavior: Utility Maximization 

A foundational hypothesis on individual behavior in modern economics in general and the 
consumer theory in particular is that a rational agent will always choose a most preferred 
bundle from the set of affordable alternatives. We will derive demand functions by con- 
sidering a model of utility-maximizing behavior coupled with a description of underlying 
economic constraints. 

2.4.2 Consumer's Optimal Choice 

In the basic problem of preference maximization, the set of affordable alternatives is just 
the set of all bundles that satisfy the consumer's budget constraint we discussed before. 
That is, the problem of preference maximization can be written as: 

max'u(x) 
such that px ^ m 
x is in X. 

There will exist a solution to this problem if the utility function is continuous and 
that the constraint set is closed and bounded. The constraint set is certainly closed. If 
Pi > for i — 1, k and m > 0, it is not difficult to show that the constraint set will 
be bounded. If some price is zero, the consumer might want an infinite amount of the 
corresponding good. 

Proposition 2.4.1 Under the local nonsatiation assumption, a utility -maximizing bundle 
x* must meet the budget constraint with equality. 

Proof. Suppose we get an x* where px* < m. Since x* costs strictly less than m, every 
bundle in X close enough to x* also costs less than m and is therefore feasible. But, 
according to the local nonsatiation hypothesis, there must be some bundle x which is 
close to x* and which is preferred to x*. But this means that x* could not maximize 
preferences on the budget set B(p,m). 
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This proposition allows us to restate the consumer's problem as 

maxw(x) 
such that px = m. 

The value of x that solves this problem is the consumer's demanded bundle: it 
expresses how much of each good the consumer desires at a given level of prices and 
income. In general, the optimal consumption is not unique. Denote by x(p, m) the set 
of all utility maximizing consumption bundles and it is called the consumer's demand 
correspondence. When there is a unique demanded bundle for each (p, m), x(p,m) 
becomes a function and thus is called the consumer's demand function. We will see from 
the following proposition that strict convexity of preferences will ensure the uniqueness 
of optimal bundle. 

Proposition 2.4.2 (Uniqueness of Demanded Bundle) If preferences are strictly con- 
vex, then for each p > there is a unique bundle x that maximizes u on the consumer's 
budget set, B(p, m). 

Proof. Suppose x' and x" both maximize u on B(p,m). Then |x' + |x" is also in 
B(p,m) and is strictly preferred to x' and x", which is a contradiction. 

Since multiplying all prices and income by some positive number does not change the 
budget set at all and thus cannot change the answer to the utility maximization problem. 

Proposition 2.4.3 (Homogeneity of Demand Function) The consumer's demand func- 
tion x(p,m) is homogeneous of degree in (jp,m) > 0, i.e., x(tp,tm) = x{p,m). 

Note that a function /(x) is homogeneous of degree k if f(tx) = t k f(x) for all 
t > 0. 

2.4.3 Consumer's First Order-Conditions 

We can characterize optimizing behavior by calculus, as long as the utility function is 
differentiable. We will analyze this constrained maximization problem using the method 
of Lagrange multipliers. The Lagrangian for the utility maximization problem can be 
written as 

C = u(x) — A(px — to), 
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where A is the Lagrange multiplier. Suppose preference is locally non-satiated. Differenti- 
ating the Lagrangian with respect to Xi, gives us the first-order conditions for the interior 
solution 

^-A Pl = fari = l,...,L (2.1) 
px = m (2.2) 

Using vector notation, we can also write equation (2.1) as 

D-u(x) = Ap. 

Here 



Dtt(i) 



<9w(x) du(x) 



dxi 8xl 

is the gradient of u: the vector of partial derivatives of u with respect to each of its 
arguments. 

In order to interpret these conditions we can divide the i th first-order condition by the 
jth f] rs t-order condition to eliminate the Lagrange multiplier. This gives us 

du(x*) 

^J = ^^ hJ =l,...,L. (2.3) 

The fraction on the left is the marginal rate of substitution between good i and j, 
and the fraction on the right is economic rate of substitution between goods % and j. 
Maximization implies that these two rates of substitution should be equal. Suppose they 
were not; for example, suppose 

9x * = i - = ?! (OA) 

\ T i Pi m { ' 

dXj 

Then, if the consumer gives up one unit of good i and purchases one unit of good j, 
he or she will remain on the same indifference curve and have an extra dollar to spend. 
Hence, total utility can be increased, contradicting maximization. 

Figure 2.10 illustrates the argument geometrically. The budget line of the consumer 
is given by {x: p±Xi + P2X2 = m}. This can also be written as the graph of an implicit 
function: X2 = m/p2 — ipi/p2)x\. Hence, the budget line has slope —p\/p2 and vertical 
intercept m/p2- The consumer wants to find the point on this budget line that achieves 
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highest utility. This must clearly satisfy the tangency condition that the slope of the indif- 
ference curve equals the slope of the budget line so that the marginal rate of substitution 
of xi for x 2 equals the economic rate of substitution of X\ for x 2 . 

GOOD 2 




GOOD 1 



Figure 2.10: Preference maximization. The optimal consumption bundle will be at a 
point where an indifference curve is tangent to the budget constraint. 



Remark 2.4.1 The calculus conditions derived above make sense only when the choice 
variables can be varied in an open neighborhood of the optimal choice and the budget 
constraint is binding. In many economic problems the variables are naturally nonnegative. 
If some variables have a value of zero at the optimal choice, the calculus conditions 
described above may be inappropriate. The necessary modifications of the conditions to 
handle boundary solutions are not difficult to state. The relevant first-order conditions 
are given by means of the so-called Kuhn- Tucker conditions: 



— Xpi ^ with equality if X; L > i = 1, . . . , L. (2.5) 

ax,, 

px 5; m with equality if A > (2.6) 

Thus the marginal utility from increasing Xi must be less than or equal to \p iy otherwise 
the consumer would increase Xj. If Xj — 0, the marginal utility from increasing Xj may be 
less than Xpi — which is to say, the consumer would like to decrease Xj. But since Xi is 
already zero, this is impossible. Finally, if Xj > so that the nonnegativity constraint is 
not binding, we will have the usual conditions for an interior solution. 
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2.4.4 Sufficiency of Consumer's First-Order Conditions 

The above first-order conditions are merely necessary conditions for a local optimum. 
However, for the particular problem at hand, these necessary first-order conditions are 
in fact sufficient for a global optimum when a utility function is quasi-concave. We then 
have the following proposition. 

Proposition 2.4.4 Suppose that u(x) is differentiable and quasi-concave on IR+ and 
ip,m) > 0. If (x,X) satisfies the first-order conditions given in (2.5) and (2.6), then 
x solves the consumer's utility maximization problem at prices p and income m. 

Proof. Since the budget set B(p,m) is convex and u(x) is differentiable and quasi- 
concave on IR^, by Proposition 1.2.1, we know x solves the consumer's utility maximization 
problem at prices p and income m. 

With the sufficient conditions in hand, it is enough to find a solution (x, A) that satisfies 
the first-order conditions (2.5) and (2.6). The conditions can typically be used to solve 
the demand functions Xi(p,m) as we show in the following examples. 

Example 2.4.1 Suppose the preference ordering is represented by the Cobb-Douglas 
utility function: u(xi,x 2 ) = x\x\~ a , which is strictly quasi-concave on Since any 

monotonic transform of this function represents the same preferences, we can also write 
u(xi,x 2 ) = alnxi + (1 — a) lnx 2 - 

The demand functions can be derived by solving the following problem: 

max a In x\ + ( 1 — a) In x 2 
such that p\X\ + p 2 x 2 = m. 

The first-order conditions are 

a 

Api = 

Xi 

or 

a 1 — a 
PlXl Pix 2 ' 

Cross multiply and use the budget constraint to get 

ap 2 x 2 = p 1 x l - ap 1 x l 

am = p\X\ 
am 

x 1 {p 1 ,p 2 ,m) = — . 

Pi 
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Substitute into the budget constraint to get the demand function for the second commod- 
ity: 

(l-a)m 
x 2 {pi,P2,m) = . 

P2 

Example 2.4.2 Suppose the preference ordering is represented by the Leotief utility 
function : u(xi,X2) = min{a:ri, fea^}. Since the Leontief utility function is not differen- 
tiable, so the maximum must be found by a direct argument. Assume p > 0. 

The optimal solution must be at the kink point of the indifference curve. That is, 

axi = bx 2 . 



Substituting x\ = -x 2 into the budget constraint px = m, we have 



b 

Pi-x 2 +P2X2 = m 
a 



and thus the demand functions are given by 



x 2 (pi,P2,m) 



am 



bpi + ap2 
and 

, . bm 

x 1 (p 1 ,p 2 ,m) = - ■ 

bpi + ap 2 

Example 2.4.3 Now suppose the preference ordering is represented by the linear utility 
function : 

u(x, y) = ax + by. 

Since the marginal rate of substitution is a/6 and the economic rate of substitution is 
Px/Py are both constant, they cannot be in general equal. So the first-order condition 
cannot hold with equality as long as a/b 7^ Px/Py In this case the answer to the utility- 
maximization problem typically involves a boundary solution: only one of the two goods 
will be consumed. It is worthwhile presenting a more formal solution since it serves as 
a nice example of the Kuhn- Tucker theorem in action. The Kuhn- Tucker theorem is the 
appropriate tool to use here, since we will almost never have an interior solution. 
The Lagrange function is 

L(x, y, A) = ax + by + X(m - p x x - p y y) 
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and thus 



dL 

& = a ~ Xp ' (2 ' 7) 

dL 

-JT = b-\ Pt (2.8) 
dy 

dL 

— = m-p x -p y (2.9) 

(2.10) 

There are four cases to be considered: 

Case 1. x > and y > 0. Then we have |^ = and = 0. Thus, f = — . Since 
A = ^- > 0, we have p x x + p y y = m and thus all x and y that satisfy p x x + p y y = m are 
the optimal consumptions. 

Case 2. x > and y = 0. Then we have §§ = and §J ^ 0. Thus, f ^ J. Since 
A = y > 0, we have p x x + = m and thus x = y- is the optimal consumption. 

Case 3. x = and y > 0. Then we have ^ < and ^ = 0. Thus, ? < & . Since 
A = y- > 0, we have p x x + p y y = m and thus y — y is the optimal consumption. 

Case 4. x = and y = 0. Then we have || ^ and g ^ 0. Since A ^ ^ > 0, we 
have p x x + p^y = m and thus m = because x = and |/ = 0. 

In summary, the demand functions are given by 



(x(p x ,py,m),y(p x ,p y ,m)) = < 



(m/p x ,0) if a/b>p x /p y 

(0,m/p y ) iia/b<p x /p y 
(x,m/p x - p y /p x x) if a/b = p x /p y 



for all x G [0, m/p x ]. 



Remark 2.4.2 In fact, it is easily found out the optimal solutions by comparing relatives 
steepness of the indifference curves and the budget line. For instance, as shown in Figure 
2.11 below, when a/b > p x /p y , the indifference curves become steeper, and thus the 
optimal solution is the one the consumer spends his all income on good x . When a/b < 
Px/Py, the indifference curves become flatter, and thus the optimal solution is the one the 
consumer spends his all income on good y. When a/b = p x /p y , the indifference curves and 
the budget line are parallel and coincide at the optimal solutions, and thus the optimal 
solutions are given by all the points on the budget line. 
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Figure 2.11: Maximize utility and minimize expenditure are normally equivalent. 



2.5 Indirect Utility, and Expenditure, and Money 
Metric Utility Functions 

2.5.1 The Indirect Utility Function 

The ordinary utility function, u(x), is defined over the consumption set X and therefore 
to as the direct utility function. Given prices p and income m, the consumer chooses 
a utility-maximizing bundle x(p,m). The level of utility achieved when x{p,m) is chosen 
thus will be the highest level permitted by the consumer's budget constraint facing p and 
m, and can be denoted by 



The function v(p,m) that gives us the maximum utility achievable at given prices and 
income is called the indirect utility function and thus it is a compose function of u(-) 
and x(p, m), i.e., 



The properties of the indirect utility function are summarized in the following propo- 
sition. 



v(p,m) 



maxw(x) 



such that px = m. 



v(p,m) = u(x(p,m)). 



(2.11) 
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Proposition 2.5.1 (Properties of the indirect utility function) If u(x) is contin- 
uous and monotonia on and (p, to) > 0, the indirect utility function has the following 
properties: 



(1) 


v (p, to) 


zs nonincreasing in p; £/ia£ is, if p' ^ p 7 


v(p , m) S f(p,m). similarly, 




v(p, to) 


nondecreasing in to. 




(2) 


v(p, to) 


is homogeneous of degree in (p, to). 




(3) 


v(p, to) 


zs quasiconvex in p; i/iai zs, {p: t>(p,m) ^ 


k} is a convex set for all k. 


(4) 


v(p, to) 


zs continuous at all p ^> 0, to > 0. 





Proof. 

(1) Let = {x: px 5^ to} and -B' = {x: p'x ^ m} for p' ^ p. Then B' is contained in 
5. Hence, the maximum of w(x) over B is at least as big as the maximum of tt(x) 
over 5'. The argument for to is similar. 

(2) If prices and income are both multiplied by a positive number, the budget set doesn't 
change at all. Thus, v(tp,tm) = v(p,m) for t > 0. 

(3) Suppose p and p' are such that v(p, to) ^ k, v(p', to) ^ fc. Let p" = tp + (1 — t)p'. 
We want to show that i>(p",m) ^ /c. Define the budget sets: 

B = {x: px < to} 
B' = {x: p'x ^ to] 
S" = {x: p"x ^ to} 

We will show that any x in B" must be in either B or B'; that is, that £? U B' D 
-B". Suppose not. We then must have px > to and p'x > to. Multiplying the 
first inequality by t and the second by (1 — t) and then summing, we find that 
tpx + (1 — t) p'x > to which contradicts our original assumption. Then B" is 
contained in BUB', the maximum of w(x) over B" is at most as big as the maximum 
of u(x) over _B U and thus i>(p",to) ^ A; by noting that v(p, to) ^ A; and 
f(p',TO) ^ A;. 
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(4) This follows from the Maximum Theorem. 



Example 2.5.1 (The General Cobb-Douglas Utility Function) Suppose a prefer- 
ence ordering is represented by the Cobb-Douglas utility function is given by: 

L 



u(x) = Y[(x t ) ai , ai >0,l = l,2,...,L. 
=1 



Since any monotonic transform of this function represents the same preferences, we can 
also write 

= 11^)^' ai>0,l = l,2,...,L. 



L 

(/(,;•) 

1=1 



where a = ^2f =1 cxi- Let a\ = ati/at. Then it reduces to the Cobb-Douglas utility 
examined before and thus the demand functions are given by 

XlM= ^ = ^l ,( = 1,2,. ...L. 
Pi api 

Substitute into the objective function and eliminate constants to get the indirect utility 
function: 



we 



>(p,m)=ri(— )■ 



i=i 

The above example also shows that monotonic transformation sometimes is very useful 
to simplify the computation of finding solutions. 

2.5.2 The Expenditure Function and Hicksian Demand 

We note that if preferences satisfy the local nonsatiation assumption, then v(p, m) will 
be strictly increasing in m. We then can invert the function and solve for m as a function 
of the level of utility; that is, given any level of utility, u, we can find the minimal amount 
of income necessary to achieve utility u at prices p. The function that relates income 
and utility in this way — the inverse of the indirect utility function - is known as the 
expenditure function and is denoted by e(p,u). Formally, the expenditure function is 
given by the following problem: 

e(p, u) = min px 
such that u(x) ^ u. 
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The expenditure function gives the minimum cost of achieving a fixed level of utility. The 
solution which is the function of (p,u) is denoted by h(p,tt) and called the Hicksian 
demand function. The Hicksian demand function tells us what consumption bundle 
achieves a target level of utility and minimizes total expenditure. 

A Hicksian demand function is sometimes called a compensated demand function. This 
terminology comes from viewing the demand function as being constructed by varying 
prices and income so as to keep the consumer at a fixed level of utility. Thus, the income 
changes are arranged to "compensate" for the price changes. Hicksian demand functions 
are not directly observable since they depend on utility, which is not directly observable. 
Demand functions expressed as a function of prices and income are observable; when we 
want to emphasize the difference between the Hicksian demand function and the usual 
demand function, we will refer to the latter as the Marshallian demand function, 
x(p, m). The Marshallian demand function is just the ordinary market demand function 
we have been discussing all along. 

Proposition 2.5.2 [Properties of the Expenditure Function.] If u(x) is continuous and 
locally non-satiated on and (p,m) > 0, the expenditure function has the following 
properties: 

(1) e(p,u) is nondecreasing in p. 

(2) e(p,u) is homogeneous of degree 1 in p. 

(3) e(p,u) is concave in p. 

(4) e(p,w) is continuous in p, for p ^> 0. 

(5) For all p > 0, e(p, u) is strictly increasing in u. 

(6) Shephard's lemma: If h(p, u) is the expenditure-minimizing bundle necessary to 
achieve utility level u at prices p, then hi(p,u) = de ^ u ^ for % = 1, k assuming the 
derivative exists and that pi > 0. 

Proof. Since the expenditure function is the inverse function of the indirect utility 
function, Properties (1), (4)-(5) are true by Properties (1) and (4) of the indirect utility 
given in Proposition 2.5.1. We only need to show the Properties (2), (3) and (6). 
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(2) We show that if x is the expenditure-minimizing bundle at prices p, then x also 
minimizes the expenditure at prices tp. Suppose not, and let x' be an expenditure 
minimizing bundle at tp so that tpx' < tpx. But this inequality implies px' < px, 
which contradicts the definition of x. Hence, multiplying prices by a positive scalar 
t does not change the composition of an expenditure minimizing bundle, and, thus, 
expenditures must rise by exactly a factor of t: e(p, u) = tpx = te(p, u). 

(3) Let (p,x) and (p',x') be two expenditure-minimizing price-consumption combina- 
tions and let p" = tp + (1 — t)p' for any ^ t ^ 1. Now, 

e(p", u) = p"x" = tpx" + (1 - t)p'x". 

Since x" is not necessarily the minimal expenditure to reach u at prices p' or p, we 
have px" ^ e(p,u) and p' ■ x" ^ e(p',u). Thus, 

e(p", u) ^ te(p, u) + (1 - t)e(p, u). 

(6) Let x* be an expenditure-minimizing bundle to achieve utility level u at prices p*. 
Then define the function 

g(p) = e(p,u) - px*. 

Since e(p,u) is the cheapest way to achieve u, this function is always non-positive. 
At p = p*,g(p*) = 0. Since this is a maximum value of g(p), its derivative must 
vanish: 

dg(p*) _ de(p*,u) 

^p--^p— Xi ~° 
Hence, the expenditure- minimizing bundles are just given by the vector of deriva- 
tives of the expenditure function with respect to the prices. 

Remark 2.5.1 We can also prove property (6) by applying the Envelop Theorem for 
the constrained version. In this problem the parameter a can be chosen to be one of the 
prices, Pi. Define the Lagrange function L(x, A) = px — \(u — u(x)). The optimal value 
function is the expenditure function e(p,u). The envelope theorem asserts that 



de(p,u) dL 
dpi dpi 



= hi{p,u), 

Xi=hi(p,u) 



which is simply Shephard's lemma. 
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We now give some basic properties of Hicksian demand functions: 

Proposition 2.5.3 (Negative Semi-Definite Substitution Matrix) The matrix of 
substitution terms (dhj(p,u)/dpi) is negative semi- definite. 

Proof. This follows 

dhj(p,u)/dpi = d 2 e(p,u)/dpidpj, 

which is negative semi-definite because the expenditure function is concave. 

Since the substitution matrix is negative semi-definite, thus it is symmetric and has 
non-positive diagonal terms. Then we have 

Proposition 2.5.4 (Symmetric Substitution Terms) The matrix of substitution terms 
is symmetric, i.e., 

dhj(p,u) d 2 e(p,u) d 2 e(p,u) dhi(p,u) 
dpi dpjdpi dpidpj dpj 

Proposition 2.5.5 (Negative Own-Substitution Terms) The compensated own-price 
effect is non-positive; that is, the Hicksian demand curves slope downward: 

dhj(p,u) = d 2 e(p,u) < 
dpi dp 2 

2.5.3 The Money Metric Utility Functions 

There is a nice construction involving the expenditure function that comes up in a variety 
of places in welfare economics. Consider some prices p and some given bundle of goods 
x. We can ask the following question: how much money would a given consumer need at 
the prices p to be as well off as he could be by consuming the bundle of goods x? If we 
know the consumer's preferences, we can simply solve the following problem: 

m(p,x) = min pz 

z 

such that u(z) ^ tt(x). 

That is, 

m(p,x) = e(p,«(x)). 

This type of function is called money metric utility function. It is also known as 
the "minimum income function," the "direct compensation function," and by a variety 
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of other names. Since, for fixed p,m(p,x) is simply a monotonic transform of the utility 
function and is itself a utility function. 

There is a similar construct for indirect utility known as the money metric indirect 
utility function, which is given by 

//(p;q,m) = e(p,v(q,m)). 

That is, /i(p; q, m) measures how much money one would need at prices p to be as well 
off as one would be facing prices q and having income m. Just as in the direct case, 
/i(p; q, m) is simply a monotonic transformation of an indirect utility function. 

Example 2.5.2 (The CES Utility Function) The CES utility function is given by 
u(xi,x 2 ) = (xl + x^) 1 ^, where ^ p < 1. It can be easily verified this utility func- 
tion is strictly monotonic increasing and strictly concave. Since preferences are in- 
variant with respect to monotonic transforms of utility, we could just as well choose 
u(x 1 ,x 2 ) = - p ln(z£ + x p 2 ). 

The first-order conditions are 

p-i 

' -Apx = 



p i p 



X'2 



P-1 

- Ap 2 



rpP I rpP 



PiXi + P2X2 = m 

Dividing the first equation by the second equation and then solving for x 2 , we have 

X 2 = X^—jP-K 
Pi 

Substituting the above equation in the budget line and solving for xi, we obtain 

1 

x\(p, m) — 



pr 1 +P2" 1 

and thus 

1 

p 2 p_1 m 
x 2 {p,m) = -g- 

Substituting the demand functions into the utility function, we get the indirect CES utility 
function: 
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or 

v(p,m) = (p[ + p r 2 )~ 1/r m 

where r = p/{p — 1). Inverting the above equation, we obtain the expenditure function 
for the CES utility function which has the form 

e(p,u) = (p[+p r 2 ) 1/r u. 

Consequently, the money metric direct and indirect utility functions are given by 

m (p,x) = (p r 1 +ti) 1/r (x p 1 +x® 1/p 

and 

/i(p;q,m) = (pl+p r 2 ) l/r \q\ + qX llr ™- 

Remark 2.5.2 The CES utility function contains several other well-known utility func- 
tions as special cases, depending on the value of the parameter p. 

(1) The linear utility function (p = 1). Simple substitution yields 

y = x 1 + x 2 . 

(2) The Cobb-Douglas utility function (p = 0). When p = the CES utility function 
is not defined, due to division by zero. However, we will show that as p approaches 
zero, the indifference curves of the CES utility function look very much like the 
indifference curves of the Cobb-Douglas utility function. 

This is easiest to see using the marginal rate of substitution. By direct calcu- 
lation, 

Mi?s = -(l)"" 1 ' (2 ' 12) 

As p approaches zero, this tends to a limit of 

MRS = - — , 

X-y 

which is simply the MRS for the Cobb-Douglas utility function. 
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(3) The Leontief utility function (p = — oo). We have just seen that the MRS of 
the CES utility function is given by equation (2.12). As p approaches — oo, this 
expression approaches 



MRS=- 




If x 2 > x 1 the MRS is (negative) infinity; if x 2 < Xi the MRS is zero. This means 
that as p approaches — oo, a CES indifference curves looks like an indifference curves 
associated with the Leontief utility function. 

2.5.4 Some Important Identities 

There are some important identities that tie together the expenditure function, the indi- 
rect utility function, the Marshallian demand function, and the Hicksian demand function. 
Let us consider the utility maximization problem 

v(p,m*) = max-u(x) (2-13) 
such that px ^ m*. 

Let x* be the solution to this problem and let u* = w(x*). Consider the expenditure 
minimization problem 

e(p, u*) = minpx (2-14) 
such that u(x) ^ u* . 

An inspection of Figure 2.12 should convince you that the answers to these two prob- 
lems should be the same x*. Formally, we have the following proposition. 

Proposition 2.5.6 (Equivalence of Utility Max and Expenditure Min) Suppose the 
utility function u is continuous and locally non-satiated, and suppose that m > 0. If the 
solutions both problems exist, then the above two problems have the same solution x* . 
That is, 

1 Utility maximization implies expenditure minimization: Let x* be a solution 
to (2. 13), and let u = u(x*). Then x* solves (2.14). 
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Figure 2.12: Maximize utility and minimize expenditure are normally equivalent. 

2 Expenditure minimization implies utility maximization. Suppose that the 
above assumptions are satisfied and that x* solves (2.14). Let m = px*. Then x* 
solves (2.13). 

Proof. 

1 Suppose not, and let x' solve (2.14). Hence, px' < px* and w(x') ^ w(x*). By local 
nonsatiation there is a bundle x" close enough to x' so that px" < px* = m and 
■u(x") > w(x*). But then x* cannot be a solution to (2.13). 

2 Suppose not, and let x' solve (2.13) so that w(x') > w(x*) and px' = px* = m. Since 
px* > and utility is continuous, we can find < t < 1 such that ptx' < px* = m 
and -u(tx') > u(x*). Hence, x cannot solve (2.14). 

This proposition leads to four important identities that is summarized in the following 
proposition. 

Proposition 2.5.7 Suppose the utility function u is continuous and locally non-satiated, 
and suppose that m > 0. Then we have 

(1) e(p,t>(p,m)) = m. The minimum expenditure necessary to reach utility t>(p,m) is 
m. 

(2) t>(p,e(p,w)) = u. The maximum utility from income e(p,w) is u. 

(3) Xi(p,m) = hi(p,v(p,m)). The Marshallian demand at income m is the same as the 
Hicksian demand at utility v(p,m). 
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(4) hi(p,u) = Xj(p, e(p, u)). The Hicksian demand at utility u is the same as the 
Marshallian demand at income e(p,u). 



This last identity is perhaps the most important since it ties together the "observable" 
Marshallian demand function with the "unobservable" Hicksian demand function. Thus, 
any demanded bundle can be expressed either as the solution to the utility maximization 
problem or the expenditure minimization problem. 

A nice application of one of these identities is given in the next proposition: 

Roy's identity. If x(p, m) is the Marshallian demand function, then 

dv(p,m) 

Xi(p,m) = - a^^ forz = !,...,*; 

dm 

provided that the right-hand side is well defined and that Pi > and m > 0. 

Proof. Suppose that x* yields a maximal utility of u* at (p*,m*). We know from our 
identities that 

x(p*,m*) = h(p*, M *). (2.15) 
From another one of the fundamental identities, we also know that 

u* = u(p,e(p,u*)). 

Since this is an identity we can differentiate it with respect to pi to get 

dv(p*,m*) dv(p*, m*) de(p*, u*) 
dpi dm dpi 

Rearranging, and combining this with identity (2.15), we have 

de(p*,u*) _ dv(p*,m*)/d Pi 



Xi(p*,m*) = hi(p*,u* 



dpi dv(p*,m*)/dm 

Since this identity is satisfied for all (p*,m*) and since x* = x(p*,m*), the result is 
proved. 

Example 2.5.3 (The General Cobb-Douglas Utility Function) Consider the indi- 
rect Cobb-Douglas utility function: 

L 

,Oiim s 

i=i 
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>(p,m) = II(— r 

- LJ - apt 



where J2t=i a i- Then we have 

dv(p,m) a x 

v Pl = — = v{p,m) 

dpi pj 

dv(p,m) a 

v m = — a = — v (Pi m ) 

am m 

Thus, Roy's identity gives the demand functions as 

x l (p,m) = ( ^— } / = 1,2, . . . ,L. 
api 

Example 2.5.4 (The General Leontief Utility Function) Suppose a preference or- 
dering is represented by the Leontief utility function is given by: 

u[x) = mm{ — , — , . . . , — } 
ai a 2 a L 

which is not itself differentiate, but the indirect utility function: 

v(p, m) = — (2-16) 
ap 

where a = (ai, a 2 , . . . , cll) and ap is the inner product, which is differentiable. Applying 
Roy's identity, we have 

. , v p ,(p,m) aim 1 a x m 

xi{p,m) = = — — ^ /— = 

v m (p,m) (ap) 2 ap ap 

Hence, Roy's identity often works well even if the differentiability properties of the state- 
ment do not hold. 

Example 2.5.5 (The CES Utility Function) The CES utility function is given by 
u(xi,X2) = (x{ + x^) 1 ^. We have derived earlier that the indirect utility function is given 
by: 

v(p,m) = {j>\+p r 2 )- l/r m. 
The demand functions can be found by Roy's law: 

-dv(p,m)/dpi +p 2 ')- (1+ - ) mrpp 1 



xi(p,m) 



dv(p,m)/dm (Pi+P'i)' 1 ^ 
= fl ~ lm „ J = 1,2. 

(Pi +P2) 
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2.6 Duality Between Direct and Indirect Utility 

We have seen how one can recover an indirect utility function from observed demand 
functions by solving the integrability equations. Here we see how to solve for the direct 
utility function. 

The answer exhibits quite nicely the duality between direct and indirect utility func- 
tions. It is most convenient to describe the calculations in terms of the normalized indirect 
utility function, where we have prices divided by income so that expenditure is identically 
one. Thus the normalized indirect utility function is given by 

v(p) = maxw(x) 

X 

such that px = 1 
We then have the following proposition 

Proposition 2.6.1 Given the indirect utility function v(p), the direct utility function can 
be obtained by solving the following problem: 

■u(x) = mint>(p) 
p 

such that px = 1 

Proof. Let x be the demanded bundle at the prices p. Then by definition v (p) = w(x). 
Let p' be any other price vector that satisfies the budget constraint so that p'x = 1. Then 
since x is always a feasible choice at the prices p', due to the form of the budget set, the 
utility-maximizing choice must yield utility at least as great as the utility yielded by x; 
that is, v(p') ^ u(x) = v(p). Hence, the minimum of the indirect utility function over all 
p's that satisfy the budget constraint gives us the utility of x. 

The argument is depicted in Figure 2.13. Any price vector p that satisfies the budget 
constraint px = 1 must yield a higher utility than m(x), which is simply to say that u(x) 
solves the minimization problem posed above. 

Example 2.6.1 (Solving for the Direct Utility Function) Suppose that we have an 
indirect utility function given by v(pi,p 2 ) = —a \npi — b lnp 2 - What is its associated direct 
utility function? We set up the minimization problem: 

min— alnpi — blnp 2 

Pl ,P2 

such that piXi + p 2 x 2 = 1. 
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Figure 2.13: Solving for the direct utility function. 



The first-order conditions are 



—a/pi = Xxi 
-b/fr = \x 2 , 



or, 



—a = Xpix 1 
-b = Xp 2 x 2 . 

Adding together and using the budget constraint yields 

A = —a — b. 

Substitute back into the first-order conditions to find 

a 

Pi = 



(a + b)xi 
b 

P2 = 7 — 

(a + b)x 2 

These are the choices of {pi,p 2 ) that minimize indirect utility. Now substitute these 
choices into the indirect utility function: 

u(x\x 2 ) = —a In- — b In 



a + b)xi (a + b)x 2 

a In x\ + b In x 2 + constant. 



This is the familiar Cobb-Douglas utility function. 
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The duality between seemingly different ways of representing economic behavior is 
useful in the study of consumer theory, welfare economics, and many other areas in 
economics. Many relationships that are difficult to understand when looked at directly 
become simple, or even trivial, when looked at using the tools of duality. 

2.7 Properties of Consumer Demand 

In this section we will examine the comparative statics of consumer demand behavior: 
how the consumer's demand changes as prices and income change. 

2.7.1 Income Changes and Consumption Choice 

It is of interest to look at how the consumer's demand changes as we hold prices fixed 
and allow income to vary; the resulting locus of utility-maximizing bundles is known 
as the income expansion path. From the income expansion path, we can derive a 
function that relates income to the demand for each commodity (at constant prices). 
These functions are called Engel curves. There are two possibilities: (1) As income 
increases, the optimal consumption of a good increases. Such a good is called a normal 
good. (2) As income increases, the optimal consumption of a good decreases. Such a 
good is called interior good. 

For the two-good consumer maximization problem, when the income expansion path 
(and thus each Engel curve) is upper-ward slopping, both goods are normal goods. When 
the income expansion path could bend backwards, there is one and only one good that 
is inferior when the utility function is locally non-satiated; increase in income means the 
consumer actually wants to consume less of the good. (See Figure 2.14) 

2.7.2 Price Changes and Consumption Choice 

We can also hold income fixed and allow prices to vary. If we let pi vary and hold p 2 and 
m fixed, the locus of tangencies will sweep out a curve known as the price offer curve. 
In the first case in Figure 2.15 we have the ordinary case where a lower price for good 
1 leads to greater demand for the good so that the Law of Demand is satisfied; in the 
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Figure 2.14: Income expansion paths with an interior good. 

second case we have a situation where a decrease in the price of good 1 brings about a 
decreased demand for good I. Such a good is called a Giffen good. 




Figure 2.15: Offer curves. In panel A the demand for good 1 increases as the price 
decreases so it is an ordinary good. In panel B the demand for good 1 decreases as its 
price decreases, so it is a Giffen good. 



2.7.3 Income-Substitution Effect: The Slutsky Equation 

In the above we see that a fall in the price of a good may have two sorts of effects: sub- 
stitution effect — one commodity will become less expensive than another, and income 
effect — total "purchasing power" increases. A fundamental result of the theory of the 
consumer, the Slutsky equation, relates these two effects. 
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Even though the compensated demand function is not directly observable, we shall see 
that its derivative can be easily calculated from observable things, namely, the derivative 
of the Marshallian demand with respect to price and income. This relationship is known 
as the Slutsky equation. 

Slutsky equation. 

dxAp.m) dhAp.vip.m)) dxAp.m) . . 

— ^ = — — 75 -7, Xi(p,m) 

dpi dpi dm 

Proof. Let x* maximize utility at (p*,m) and let u* = w(x*). It is identically true that 

hj(p*,u*) = Xj(p,e(p,u*)). 

We can differentiate this with respect to pi and evaluate the derivative at p* to get 

dhj(p*,u*) _ dxj(p*,m*) dxj(p*, m*) de(p*, u*) 
dpi dpi dm dpi 

Note carefully the meaning of this expression. The left-hand side is how the compensated 

demand changes when pi changes. The right-hand side says that this change is equal to 

the change in demand holding expenditure fixed at m* plus the change in demand when 

income changes times how much income has to change to keep utility constant. But this 

last term, de(p*,u*)/dpi, is just x*; rearranging gives us 

dxj(p*,m*) dhj(p*,u*) dxj(p*,m*) „ 
dpi dpi dm 

which is the Slutsky equation. 

There are other ways to derive Slutsky's equations that can be found in Varian (1992). 

The Slutsky equation decomposes the demand change induced by a price change Api 

into two separate effects: the substitution effect and the income effect: 

^ dx s (p,m) = dhfau) _ dxfam) 
dpi ' dpi ' dm 

As we mentioned previously, the restrictions all about the Hicksian demand functions 
are not directly observable. However, as indicated by the Slutsky equation, we can express 
the derivatives of h with respect to p as derivatives of x with respect to p and m, and 
these are observable. 

Also, Slutsky's equation and the negative sime-definite matrix on Hicksian demand 
functions given in Proposition 2.5.3 give us the following result on the Marshallian demand 
functions: 
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»- GOOD 1 

Substitution Income 
effect effect 



Figure 2.16: The Hicks decomposition of a demand change into two effects: the substitu- 
tion effect and the income effect. 

Proposition 2.7.1 The substitution matrix ^ + & 3m''^ i ) ^ s a s U mm etric, neg- 
ative semi- definite matrix. 

This is a rather nonintuitive result: a particular combination of price and income 
derivatives has to result in a negative semidefinite matrix. 

Example 2.7.1 (The Cobb-Douglas Slutsky equation) Let us check the Slutsky e- 
quation in the Cobb-Douglas case. As we've seen, in this case we have 

v(pi,p 2 ,m) = mp^ a p2~ l 

e(pi,P2,u) = uplp x 2 a 

, . am 
x 1 (p 1 ,p 2 ,m) = 

Pi 

h 1 (p 1 ,p 2 ,u) = ap^ 1 p\~ a u. 



69 



Thus 



dxi(p, m) 
dpi 

dx\{p, m) 

dm 
0foi(p,tt) 
dpi 

dhi{p,v{p,m)) 
dpi 





Pi 



am 



a 



Now plug into the Slutsky equation to find 



dhi dxi 



a(a — l)m a am 



dpi dm 



Xi = 



p\ Pi Pi 

[a(a — 1) — a 2 ]m 




dx\ 



p\ dpi ' 



2.7.4 Continuity and Differentiability of Demand Functions 

Up until now we have assumed that the demand functions are nicely behaved; that is, that 
they are continuous and even differentiable functions. Are these assumptions justifiable? 

Proposition 2.7.2 (Continuity of Demand Function) Suppose >z is continuous and 
weakly convex, and (p,m) > 0. Then, x(p,m) is a upper hemi- continuous convex-valued 
correspondence. Furthermore, if the weak convexity is replaced by the strict convexity, 
x(p, m) is a continuous single-valued function. 

Proof. First note that, since (p,m) > 0, one can show that the budget constrained 
set B(p,m) is a continuous correspondence with non-empty and compact values and ^ 
is continuous. Then, by the Maximum Theorem, we know the demand correspondence 
x(p,m) is upper hemi-continuous. We now show x(p,m) is convex. Suppose x and x' are 
two optimal consumption bundles. Let x t — tx + (1 — t)x' for t G [0, 1]. Then, x t also 
satisfied the budget constraint, and by weak convexity of y, we have x t = tx+(l—t)x' >z x. 
Because x is an optimal consumption bundle, we must have x t ~ x and thus x t is also an 
optimal consumption bundle. 
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Now, when y is strictly convex, x{p,m) is then single- valued by Proposition 2.4.2, 
and thus it is a continuous function since a upper hemi-continuous correspondence is a 
continuous function when it is single- valued. 

A demand correspondence may not be continuous for non-convex preference ordering, 
as illustrated in Figure 2.17. Note that, in the case depicted in Figure 2.17, a small 
change in the price brings about a large change in the demanded bundles: the demand 
correspondence is discontinuous. 



\ 

! I 
i i 

! I 
J L 

v ' QUANTITY 
X, * 1 



Figure 2.17: Discontinuous demand. Demand is discontinuous due to non-convex prefer- 
ences 

Sometimes, we need to consider the slopes of demand curves and hence we would like 
a demand function is differentiable. What conditions can guarantee the differentiability? 
We give the following proposition without proof. 

Proposition 2.7.3 Suppose x > solves the consumer's utility maximization problem at 
(p,m) > 0. // 

(1) u is twice continuously differentiable on 

(2) ^>0for somel = l,...,L, 

(3) the bordered Hessian of u has nonzero determinant at x, 
then x(p,m) is differentiable at (p,m). 
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2.7.5 Inverse Demand Functions 

In many applications it is of interest to express demand behavior by describing prices as 
a function of quantities. That is, given some vector of goods x, we would like to find a 
vector of prices p and an income m at which x would be the demanded bundle. 

Since demand functions are homogeneous of degree zero, we can fix income at some 
given level, and simply determine prices relative to this income level. The most convenient 
choice is to fix m — 1. In this case the first-order conditions for the utility maximization 
problem are simply 

<9m(x) 

— \pi = for i,...,k 

OXi 

k 

^2piXi = i. 
i=i 

We want to eliminate A from this set of equations. 

To do so, multiply each of the first set of equalities by Xi and sum them over the 
number of goods to get 

i=l 1 i=l 

Substitute the value of A back into the first expression to find p as function of x: 

du (x) 

Pi(x) = da* . (2.17) 



i=i 



,, <9w(x) 

dxi 3 



Given any vector of demands x, we can use this expression to find the price vector 
p(x) which will satisfy the necessary conditions for maximization. If the utility function 
is quasi-concave so that these necessary conditions are indeed sufficient for maximization, 
then this will give us the inverse demand relationship. 

What happens if the utility function is not everywhere quasi-concave? Then there 
may be some bundles of goods that will not be demanded at any price; any bundle on a 
non-convex part of an indifference curve will be such a bundle. 

There is a dual version of the above formula for inverse demands that can be obtained 
from the duality between direct utility function and indirect utility function we discussed 
earlier. The argument given there shows that the demanded bundle x must minimize 
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indirect utility over all prices that satisfy the budget constraint. Thus x must satisfy the 
first-order conditions 

^(P) n f i 1 t 

— axi = (J tor / = 1, . . . , L 

dpi 

L 

^Pixi = 1. 

1=1 

Now multiply each of the first equations by p\ and sum them to find that fi = 
Yld=i ^fy~Pi- Substituting this back into the first-order conditions, we have an expression 
for the demanded bundle as a function of the normalized indirect utility function: 

dv(p) 

*<(p) = v /£(p) • (2-18) 
^i=i d Pj Pi 

Note the nice duality: the expression for the direct demand function, (2.18), and the 
expression for the inverse demand function (2.17) have the same form. This expression 
can also be derived from the definition of the normalized indirect utility function and 
Roy's identity. 



2.8 The Integrability Problem 

Given a system of demand functions x(p,m). Is there necessarily a utility function from 
which these demand functions can be derived? This question is known as the integrabil- 
ity problem. We will show how to solve this problem by solving a differential equation 
and integrating back, ultimately to the utility function. The Slutsky matrix plays a key 
role in this process. 

We have seen that the utility maximization hypothesis imposes certain observable 
restrictions on consumer behavior. If a demand function x(p,m) is well-behaved, our 
previous analysis has shown that x(p, m) satisfies the following five conditions: 

1. Nonnegativity: x(p,m) ^ 0. 

2. Homogeneity: x(tp,tm) = x(p,m). 

3. Budget Balancedness: px(p, m) = m. 

4. Symmetry: The Slutsky matrix S = [ dx f p ™ ] + y i 3 (p,m)) is sym- 
metric. 

73 



5. Negative Semi-definite: The matrix S is negative semi-definite. 

The main result of the integrability problem is that these conditions, together with 
some technical assumptions, are in fact sufficient as well as necessary for the integrability 
process as shown by Hurwicz and Uzawa (1971). This result is very important from the 
point review of political economy. The utility maximization approach to the study of 
consumer behavior sometimes are criticized because they think the notion of utility is 
a psychological measurement and cannot be observed, and thus, they think the demand 
function from utility maximization is meaningless. The integrability result, however, 
tells us that a utility function can be derived from observable data on demand although 
the utility function is not directly observable. This impress result warrants of a formal 
statement. 

Theorem 2.8.1 A continuous differentiable function x : — > is the demand 
function generalized by some increasing, quasi-concave utility function u if ( and only if, 
when u is continuous, strictly increasing and strictly quasi- concave) it satisfy homogeneity, 
budget balancedness, symmetry, and negative semi-definiteness. 

The proof of the theorem is somehow complicated and can be found in Hurwicz and 
Uzawa (1971). So it is omitted here. 

To actually find a utility function from a give system of demand functions, we must 
find an equation to integrate. As it turns out, it is somewhat easier to deal with the 
integrability problem in terms of the expenditure function rather than the indirect utility 
function. 

Recall that from Shephard's lemma given in Proposition 2.5.2, 



We also specify a boundary condition of the form e(p*,u)) = c where p* and c are given. 
The system of equations given in (2.19) is a system of partial differential equations. It 
is well-known that a system of partial differential equations of the form 



de(p,u) 
dpi 



Xi(p,m) = Xi(p,e(p,u)) i = !,...,£,. 



(2.19) 




has a (local) solution if and only if 





all i and j. 



d Pj 



dpi 
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Applying this condition to the above problem, we see that it reduces to requiring that 
the matrix 

dxi(p,m) dxi(p,m)de(p,u) 



dpj dm dpj 

is symmetric. But this is just the Slutsky restriction! Thus the Slutsky restrictions imply 

that the demand functions can be "integrated" to find an expenditure function consistent 

with the observed choice behavior. 

Under the assumption that all five of the properties listed at the beginning of this 

section hold, the solution function e will be an expenditure function. Inverting the found 

expenditure function, we can find the indirect utility function. Then using the duality 

between the direct utility function and indirect utility function we will study in the next 

section, we can determine the direct utility function. 

Example 2.8.1 (The General Cobb-Douglas Utility Function) Consider the demand 
functions 

aim ^m 



Xi(p,m) 



Pi api 



where a = Yla=i a i- The system (2.19) becomes 

de(p,u) = a j m {22Q) 

dpi api 

The i-equation can be integrated with respect to Pi to obtain. 

In e(p, u) = — In pi + c% 
a 

where q does not depend on p iy but it may depend on pj for j ^ i. Thus, combining these 
equations we find 

L 

lne(p, w) = } — In pj + c 

A — ' a 

i=i 

where c is independent of all p^s. The constant c represents the freedom that we have 
in setting the boundary condition. For each u, let us take p* = (1, . . . , 1) and use the 
boundary condition e(p*,u) = u. Then it follows that 



L 

OLi 



lne(p, u) = — In pj + lnu. 
Inverting the above equation, we have 

In v(p, m) = — — In pj + In 

i=l 
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a 
i=i 



L 

m 

a 

%=i 



which is a monotonic transformation of the indirect utility function for a Cobb-Douglas 
we found previously. 

2.9 Revealed Preference 

2.9.1 Axioms of Revealed Preferences 

The basic preference axioms sometimes are criticized as being too strong on the grounds 
that individuals are unlikely to make choices through conscious use of a preference relation. 
One response to this criticism is to develop an alternative theory on the basis of a weaker 
set of hypotheses. One of the most interesting alternative theories is that of revealed 
preference, which is discussed in this section. 

The basic principle of revealed preference theory is that preference statements should 
be constructed only from observable decisions, that is, from actual choice made by a con- 
sumer. An individual preference relation, even if it exists, can never be directly observed 
in the market. The best that we may hope for in practice is a list of the choices made 
under different circumstances. For example, we may have some observations on consumer 
behavior that take the form of a list of prices, p*, and the associated chosen consumption 
bundles, x* for t — 1, T. How can we tell whether these data could have been generat- 
ed by a utility-maximizing consumer? Revealed preference theory focuses on the choices 
made by a consumer, not on a hidden preference relation. 

We will say that a utility function rationalizes the observed behavior (p*x*) for t = 
1,...,T if u(x*) ^ -u(x) for all x such that p*x* ^ p*x. That is, w(x) rationalizes 
the observed behavior if it achieves its maximum value on the budget set at the chosen 
bundles. Suppose that the data were generated by such a maximization process. What 
observable restrictions must the observed choices satisfy? 

Without any assumptions about tt(x) there is a trivial answer to this question, namely, 
no restrictions. For suppose that -u(x) were a constant function, so that the consumer 
was indifferent to all observed consumption bundles. Then there would be no restrictions 
imposed on the patterns of observed choices: anything is possible. 

To make the problem interesting, we have to rule out this trivial case. The easiest 
way to do this is to require the underlying utility function to be locally non-satiated. Our 
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question now becomes: what are the observable restrictions imposed by the maximization 
of a locally non-satiated utility function? 

Direct Revealed Preference: If p*x* ^ p*x, then w(x*) ^ w(x). We will say that 
x* is directly revealed preferred to x, and write x*i? z? x. 

This condition means that if x* was chosen when x could have been chosen, the utility 
of x* must be at least as large as the utility of x. As a consequence of this definition and 
the assumption that the data were generated by utility maximization, we can conclude 
that " x'i? D x implies u(x*) ^ w(x) ." 

Strictly Direct Revealed Preference: If p*x* > p*x, then w(x*) > w(x). We will 
say that x* is strictly directly revealed preferred to x and write x i P D x. 

It is not hard to show that local non-satiation implies this conclusion. For we know 
from the previous paragraph that it(x') ^ u(x); if w(x*) = u(x), then by local non- 
satiation there would exist some other x' close enough to x so that p*x* > p*x' and 
u(x') > w(x) = w(x*). This contradicts the hypothesis of utility maximization. 

Revealed Preference: x* is said to be revealed preferred to x if there exists a finite 
number of bundles Xi,x 2 , ■ ■ ■ ,x n such that x'i^x 1 , x 1 i? D x 2 , x™_R D x. In this case, we 
write x*_Rx. 

The relation R constructed above by considering chains of R D is sometimes called the 
transitive closure of the relation R D . If we assume that the data were generated by 
utility maximization, it follows that "x*i?x implies tt(x*) ^ -u(x)." 

Consider two observations x* and x. s . We now have a way to determine whether u(x*) ^ 
u(x s ) and an observable condition to determine whether w(x s ) > u(x*). Obviously, these 
two conditions should not both be satisfied. This condition can be stated as the 

GENERALIZED AXIOM OF REVEALED PREFERENCE (GARP): // x* is 

revealed preferred to x s , then x s cannot be strictly directly revealed preferred to x*. 

Using the symbols defined above, we can also write this axiom as 

GARP: x*i? x 6 ' implies not x s P D x*. In other words, x*i?x s , implies p s x. s ^ p s x*. 

As the name implies, GARP is a generalization of various other revealed preference 
tests. Here are two standard conditions. 

WEAK AXIOM OF REVEALED PREFERENCE (WARP): // ^R D x s and x< 

is not equal to x s , then it is not the case that x s _R D x*, i.e., p t x t ^ ptx s implies p s x t > p s x s . 
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STRONG AXIOM OF REVEALED PREFERENCE (SARP): // x*i2 x s and 
x* is not equal to x s , then it is not the case that x s i?x*. 

Each of these axioms requires that there be a unique demand bundle at each budget, 
while GARP allows for multiple demanded bundles. Thus, GARP allows for flat spots in 
the indifference curves that generated the observed choices. 

2.9.2 Characterization of Revealed Preference Maximization 

If the data (p*,x*) were generated by a utility-maximizing consumer with nonsatiated 
preferences, the data must satisfy GARP. Hence, GARP is an observable consequence of 
utility maximization. But does it express all the implications of that model? If some data 
satisfy this axiom, is it necessarily true that it must come from utility maximization, or at 
least be thought of in that way? Is GARP a sufficient condition for utility maximization? 

It turns out that it is. If a finite set of data is consistent with GARP, then there exists 
a utility function that rationalizes the observed behavior — i.e., there exists a utility 
function that could have generated that behavior. Hence, GARP exhausts the list of 
restrictions imposed by the maximization model. 

We state the following theorem without proof. 

Afriat's theorem. Let (p*,x*) for t = 1,...,T be a finite number of observations of 
price vectors and consumption bundles. Then the following conditions are equivalent. 

(1) There exists a locally nonsatiated utility function that rationalizes the data; 

(2) The data satisfy GARP; 

(3) There exist positive numbers («*, A*) for t — 1, . . . , T that satisfy the Afriat inequal- 
ities: 

u s ^u l + A*p* (x s - x *) for all t, s; 

(4) There exists a locally nonsatiated, continuous, concave, monotonic utility function 
that rationalizes the data. 

Thus, Afriat's theorem states that a finite set of observed price and quantity data 
satisfy GARP if and only if there exists a locally non-satiated, continuous, increasing, 
and concave utility function that rationalizes the data. 

78 



Condition (3) in Afriat's theorem has a natural interpretation. Suppose that -u(x) is 
a concave, differentiable utility function that rationalizes the observed choices. The fact 
that u(x) is differentiable implies it must satisfy the T first-order conditions 



D«(x*) = A'p* 



(2.21) 



The fact that -u(x) is concave implies that it must satisfy the concavity conditions 



u 



(x*) ^«(x s ) + Dw(x s )(x'-x s ). 



(2.22) 



Substituting from (2.21) into (2.22), we have 



u(x*) ^ «(x s ) + A s p s (x* - x s ^ 



Hence, the Afriat numbers u f and A* can be interpreted as utility levels and marginal 
utilities that are consistent with the observed choices. 




Figure 2.18: Concave function. 



The reason the inequality holds for a concave function is because that, from Figure 
2.18, we have 



■u(rr*) — u(x s ) 
x l — X s 

Thus, we have u(x*) ^ u(x s ) + u'(x s ) (x* — x s ). 



^ u\x s ). 



(2.23) 
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The most remarkable implication of Afriat's theorem is that (1) implies (4): if there 
is any locally nonsatiated utility function at all that rationalizes the data, there must 
exist a continuous, monotonic, and concave utility function that rationalizes the data. 
If the underlying utility function had the "wrong" curvature at some points, we would 
never observe choices being made at such points because they wouldn't satisfy the right 
second-order conditions. Hence market data do not allow us to reject the hypotheses of 
convexity and monotonicity of preferences. 

2.10 Recoverability 

Since the revealed preference conditions are a complete set of the restrictions imposed 
by utility-maximizing behavior, they must contain all of the information available about 
the underlying preferences. It is more-or-less obvious now to use the revealed preference 
relations to determine the preferences among the observed choices, x*, for t — 1, . . . , T. 
However, it is less obvious to use the revealed preference relations to tell you about 
preference relations between choices that have never been observed. 

This is easiest to see using an example, Figure 2.19 depicts a single observation of choice 
behavior, (p^x 1 ). What does this choice imply about the indifference curve through a 
bundle x°? Note that x° has not been previously observed; in particular, we have no data 
about the prices at which x° would be an optimal choice. 

Let's try to use revealed preference to "bound" the indifference curve through x°. 
First, we observe that x 1 is revealed preferred to x°. Assume that preferences are convex 
and monotonic. Then all the bundles on the line segment connecting x° and x 1 must be 
at least as good as x°, and all the bundles that lie to the northeast of this bundle are at 
least as good as x°. Call this set of bundles i?P(x°), for "revealed preferred" to x°. It is 
not difficult to show that this is the best "inner bound" to the upper contour set through 
the point x°. 

To derive the best outer bound, we must consider all possible budget lines passing 
through x°. Let RW be the set of all bundles that are revealed worse than x° for all these 
budget lines. The bundles in RW are certain to be worse than x° no matter what budget 
line is used. 
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Figure 2.19: Inner and outer bounds. RP is the inner bound to the indifference curve 
through x°; the consumption of RW is the outer bound. 

The outer bound to the upper contour set at x° is then defined to be the complement 
of this set: NRW = all bundles not in RW. This is the best outer bound in the sense 
that any bundle not in this set cannot ever he revealed preferred to x° by a consistent 
utility-maximizing consumer. Why? Because by construction, a bundle that is not in 
NRW(x°) must be in RW(x°) in which case it would be revealed worse than x°. 

In the case of a single observed choice, the bounds are not very tight. But with 
many choices, the bounds can become quite close together, effectively trapping the true 
indifference curve between them. See Figure 2.20 for an illustrative example. It is worth 
tracing through the construction of these bounds to make sure that you understand where 
they come from. Once we have constructed the inner and outer bounds for the upper 
contour sets, we have recovered essentially all the information about preferences that is 
not aimed in the observed demand behavior. Hence, the construction of RP and RW is 
analogous to solving the integrability equations. 

Our construction of RP and RW up until this point has been graphical. However, 
it is possible to generalize this analysis to multiple goods. It turns out that determining 
whether one bundle is revealed preferred or revealed worse than another involves checking 
to see whether a solution exists to a particular set of linear inequalities. 
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Figure 2.20: Inner and outer bounds. When there are several observations, the inner 
bound and outer bound can be quite tight. 

2.11 Topics in Demand Behavior 

In this section we investigate several topics in demand behavior. Most of these have to 
do with special forms of the budget constraint or preferences that lead to special forms 
of demand behavior. There are many circumstances where such special cases are very 
convenient for analysis, and it is useful to understand how they work. 

2.11.1 Endowments in the Budget Constraint 

In our study of consumer behavior we have taken income to be exogenous. But in more 
elaborate models of consumer behavior it is necessary to consider how income is generated. 
The standard way to do this is to think of the consumer as having some endowment 
u> = (o>i, . . . , ojl) of various goods which can be sold at the current market prices p. This 
gives the consumer income m = pu which can be used to purchase other goods. 
The utility maximization problem becomes 

max-u(x) 

X 

such that px = pu. 

This can be solved by the standard techniques to find a demand function x(p, pu). The 
net demand for good i is Xi — Ui. The consumer may have positive or negative net 
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demands depending on whether he wants more or less of something than is available in 
his endowment. 

In this model prices influence the value of what the consumer has to sell as well as 
the value of what the consumer wants to sell. This shows up most clearly in Slutsky's 
equation, which we now derive. First, differentiate demand with respect to price: 



dxi(p,pu) _ dxi(p,pu) 



dpj dpj 



pui=constant 



dx j(p,p uj) 
dm 



. s\r 3i / 

+ ; Uj. 



The first term in the right-hand side of this expression is the derivative of demand with 
respect to price, holding income fixed. The second term is the derivative of demand with 
respect to income, times the change in income. The first term can be expanded using 
Slutsky's equation. Collecting terms we have 

dxj(p,puj) _ dhj(p,u) dxj(p,pu) _ ^ 

dpj dpj dm j j 

Now the income effect depends on the net demand for good j rather than the gross 
demand. 



2.11.2 Income-Leisure Choice Model 

Suppose that a consumer chooses two goods, consumption and "leisure". Let £ be the 
number of hours and L be the maximum number of hours that the consumer can work. 
We then have L = L — i. She also has some nonlabor income m. Let u(c, L) be the utility 
of consumption and leisure and write the utility maximization problem as 

maxit(c, L) 

c,L 

such that pc + wL = wL + m. 

This is essentially the same form that we have seen before. Here the consumer "sells" her 

endowment of labor at the price w and then buys some back as leisure. 

Slutsky's equation allows us to calculate how the demand for leisure changes as the 

wage rate changes. We have 

dL(p,w,m) dL(p,w,u) dL(p,w,m) — 
dw dw dm 

Note that the term in brackets is nonnegative by definition, and almost surely positive 

in practice. This means that the derivative of leisure demand is the sum of a negative 
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number and a positive number and is inherently ambiguous in sign. In other words, an 
increase in the wage rate can lead to either an increase or a decrease in labor supply. 

2.11.3 Homothetic Utility Functions 

A function /: R n — )■ R is homogeneous of degree 1 if /(tx) = t/(x) for all t > 0. A 
function /(x) is homothetic if /(x) = g(/i(x)) where g is a strictly increasing function 
and h is a function which is homogeneous of degree 1. 

Economists often find it useful to assume that utility functions are homogeneous or 
homothetic. In fact, there is little distinction between the two concepts in utility theory. 
A homothetic function is simply a monotonic transformation of a homogeneous function, 
but utility functions are only defined up to a monotonic transformation. Thus assuming 
that preferences can be represented by a homothetic function is equivalent to assuming 
that they can be represented by a function that is homogeneous of degree 1. If a consumer 
has preferences that can be represented by a homothetic utility function, economists say 
that the consumer has homothetic preferences . 

It can easily shown that if the utility function is homogeneous of degree 1, then the 
expenditure function can be written as e(p,w) = e(p)w. This in turn implies that the 
indirect utility function can be written as v(p,m) = v(p)m. Roy's identity then implies 
that the demand functions take the form x,(p, m) = Xi(p)m - i.e., they are linear functions 
of income. The fact that the "income effects" take this special form is often useful in 
demand analysis, as we will see below. 

2.11.4 Aggregating Across Goods 

In many circumstances it is reasonable to model consumer choice by certain "partial" 
maximization problems. For example, we may want to model the consumer's choice of 
"meat" without distinguishing how much is beef, pork, lamb, etc. In most empirical work, 
some kind of aggregation of this sort is necessary. 

In order to describe some useful results concerning this kind of separability of consump- 
tion decisions, we will have to introduce some new notation. Let us think of partitioning 
the consumption bundle into two "subbundles" so that the consumption bundle takes the 
form (x, z). For example, x could be the vector of consumptions of different kinds of 
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meat, and z could be the vector of consumption of all other goods. 

We partition the price vector analogously into (p, q). Here p is the price vector for the 
different kinds of meat, and q is the price vector for the other goods. With this notation 
the standard utility maximization problem can be written as 

maxw(x, z) (2.24) 

x,z 

such that px + qz = m. 

The problem of interest is under what conditions we can study the demand problem for 
the x-goods, say, as a group, without worrying about how demand is divided among the 
various components of the x-goods. 

One way to formulate this problem mathematically is as follows. We would like to be 
able to construct some scalar quantity index, X, and some scalar price index, P, that 
are functions of the vector of quantities and the vector of prices: 

P = /(p) (2.25) 
X = g(x). 

In this expression P is supposed to be some kind of "price index" which gives the "average 
price" of the goods, while X is supposed to be a quantity index that gives the average 
"amount" of meat consumed. Our hope is that we can find a way to construct these price 
and quantity indices so that they behave like ordinary prices and quantities. 

That is, we hope to find a new utility function U(X,z), which depends only on the 
quantity index of x-consumption, that will give us the same answer as if we solved the 
entire maximization problem in (2.24). More formally, consider the problem 

max U (X, z) 

X,z 

such that PX + qz = m. 

The demand function for the quantity index X will be some function X(P, q, m). We 
want to know when it will be the case that 

X(P,q,m) = X(f(p),q,m) = g(x(p,q,m)). 

This requires that we get to the same value of X via two different routes: 
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1) first aggregate prices using P = /(p) and then maximize U(X, z) subject to the 
budget constraint PX + qz = m. 

2) first maximize u(x, z) subject to px + qz = m and then aggregate quantities to get 
X = g(x). 

There are two situations under which this kind of aggregation is possible. The first 
situation, which imposes constraints on the price movements, is known as Hicksian 
separability. The second, which imposes constraints on the structure of preferences, is 
known as functional separability. 

Hicksian separability 

Suppose that the price vector p is always proportional to some fixed base price vector p° 
so that p = tp° for some scalar t. If the x-goods are various kinds of meat, this condition 
requires that the relative prices of the various kinds of meat remain constant — they all 
increase and decrease in the same proportion. 

Following the general framework described above, let us define the price and quantity 
indices for the x-goods by 

P = t 
X = p°x. 

We define the indirect utility function associated with these indices as 

V(P, q, m) = max«(x, z) 

x,z 

such that Pp°x + qz = m. 

It is straightforward to check that this indirect utility function has all the usual properties: 
it is quasiconvex, homogeneous in price and income, etc. In particular, a straightforward 
application of the envelope theorem shows that we can recover the demand function for 
the x-good by Roy's identity: 

dV(P,g,m)/dP 

X(P,q,m) = -7777775 7777- = p u x(p,q,m). 

oV(P, q, m)/am 

This calculation shows that X(P, q, m) is an appropriate quantity index for the rr-goods 
consumption: we get the same result if we first aggregate prices and then maximize 
U(X,z) as we get if we maximize u(x,z) and then aggregate quantities. 
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We can solve for the direct utility function that is dual to V(P, q, m) by the usual 
calculation: 

U(X,z) = minV{P,q,m) 

such that PX + qz = m. 

By construction this direct utility function has the property that 

V{P,q,m) = max£/(X,z) 

X,z 

such that PX + qz = m. 

Hence, the price and quantity indices constructed this way behave just like ordinary prices 
and quantities. 

The two-good model 

One common application of Hicksian aggregation is when we are studying the demand 
for a single good. In this case, think of the z-goods as being a single good, z, and the 
x-goods as "all other goods." The actual maximization problem is then 

max-u(x, z) 

x,z 

such that px + qz = m. 

Suppose that the relative prices of the x-goods remains constant, so that p = Pp°. That 
is, the vector of prices p is some base price vector p° times some price index P. Then 
Hicksian aggregation says that we can write the demand function for the z-good as 

z = z(P, q, m). 

Since this demand function is homogeneous of degree zero, with some abuse of notation, 
we can also write 

z = z(q/P, m/P). 

This says that the demand for the z-good depends on the relative price of the z-good to 
"all other goods" and income, divided by the price of "all other goods." In practice, the 
price index for all other goods is usually taken to be some standard consumer price index. 
The demand for the 2-good becomes a function of only two variables: the price of the z 
-good relative to the CPI and income relative to the CPI. 
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Functional separability 

The second case in which we can decompose the consumer's consumption decision is 
known as the case of functional separability. Let us suppose that the underlying preference 
ordering has the property that 

(x, z) y (x',z) if and only if (x, z') y (x',z') 

for all consumption bundles x, x', z and z'. This condition says that if x is preferred to x' 
for some choices of the other goods, then x is preferred to x' for all choices of the other 
goods. Or, even more succinctly, the preferences over the x-goods are independent of the 
z-goods. 

If this "independence" property is satisfied and the preferences are locally nonsatiated, 
then it can be shown that the utility function for x and z can be written in the form 
m(x, z) = U(v(x),z), where U(v,z) is an increasing function of v. That is, the overall 
utility from x and z can be written as a function of the subutility of x, u(x), and the 
level of consumption of the z-goods. 

If the utility function can be written in this form, we will say that the utility function 
is weakly separable. What does separability imply about the structure of the utility 
maximization problem? As usual, we will write the demand function for the goods as 
x(p, q, to) and z(p, q, to). Let m x = px(p, q, to) be the optimal expenditure on the 
x-goods. 

It turns out that if the overall utility function is weakly separable, the optimal choice 
of the x-goods can be found by solving the following subutility maximization problem: 

maxt>(x) 

such that px = m x . (2.26) 

This means that if we know the expenditure on the x-goods, m x = px(p, q, to), we can 
solve the subutility maximization problem to determine the optimal choice of the x-goods. 
In other words, the demand for the x-goods is only a function of the prices of the x-goods 
and the expenditure on the rr-goods m x . The prices of the other goods are only relevant 
insofar as they determine the expenditure on the x-goods. 

The proof of this is straightforward. Assume that x(p, q, to) does not solve the above 
problem. Instead, let x' be another value of x that satisfies the budget constraint and 
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yields strictly greater subutility. Then the bundle (x', z) would give higher overall utility 
than (x(p, q, to), z(p, q, to)), which contradicts the definition of the demand function. 

The demand functions x(p, m x ) are sometimes known as conditional demand func- 
tions since they give demand for the x -goods conditional on the level of expenditure 
on these goods. Thus, for example, we may consider the demand for beef as a function 
of the prices of beef, pork, and lamb and the total expenditure on meat. Let e(p, v) be 
the expenditure function for the subutility maximization problem given in (2.26). This 
tells us how much expenditure on the x-goods is necessary at prices p to achieve the 
subutility v. It is not hard to see that we can write the overall maximization problem of 
the consumer as 

max U (v, z) 

v,z 

such that e(p,v ) + qz = m 

This is almost in the form we want: v is a suitable quantity index for the x-goods, but 
the price index for the x-goods isn't quite right. We want P times X, but we have some 
nonlinear function of p and X — v. 

In order to have a budget constraint that is linear in quantity index, we need to assume 
that subutility function has a special structure. For example. Suppose that the subutility 
function is homothetic. Then we can write e(p,t> ) as e(p)v. Hence, we can choose our 
quantity index to be X = i>(x), our price index to be P = e(p), and our utility function 
to be U(X,z). We get the same X if we solve 

max U (X, z) 

X ,z 

such that PX + qz = to 

as if we solve 

maxw(t) (x), z) 

x,z 

such that px + qz = to, 

and then aggregate using X = i>(x). 

In this formulation we can think of the consumption decision as taking place in two 
stages: first the consumer considers how much of the composite commodity (e.g., meat) 

89 



to consume as a function of a price index of meat by solving the overall maximization 
problem; then the consumer considers how much beef to consume given the prices of the 
various sorts of meat and the total expenditure on meat, which is the solution to the 
subutility maximization problem. Such a two-stage budgeting process is very convenient 
in applied demand analysis. 

2.11.5 Aggregating Across Consumers 

We have studied the properties of a consumer's demand function, x(p, ra). Now let us 
consider some collection of i — 1, . . . ,n consumers, each of whom has a demand function 
for some L commodities, so that consumer i's demand function is a vector Xj(p, m,) = 
(x}(p, rrii), . . . , xf (p, raj)) for % — 1, . . . , n. Note that we have changed our notation slight- 
ly: goods are now indicated by superscripts while consumers are indicated by subscripts. 
The aggregate demand function is defined by X(p, rai, . . . , m n ) = Y^=i x i(P> m )- The ag- 
gregate demand for good / is denoted by X l (p, m) where m denotes the vector of incomes 
(mi, ...,m n ). 

The aggregate demand function inherits certain properties of the individual demand 
functions. For example, if the individual demand functions are continuous, the aggregate 
demand function will certainly be continuous. Continuity of the individual demand func- 
tions is a sufficient but not necessary condition for continuity of the aggregate demand 
functions. 

What other properties does the aggregate demand function inherit from the individual 
demands? Is there an aggregate version of Slutsky's equation or of the Strong Axiom of 
Revealed Preference? Unfortunately, the answer to these questions is no. In fact the 
aggregate demand function will in general possess no interesting properties other than 
homogeneity and continuity. Hence, the theory of the consumer places no restrictions 
on aggregate behavior in general. However, in certain cases it may happen that the 
aggregate behavior may look as though it were generated by a single "representative" 
consumer. Below, we consider a circumstance where this may happen. 

Suppose that all individual consumers' indirect utility functions take the Gorman 
form: 

Vi(p, mi) = Oj(p) + b(p)rrii. 
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Note that the Oj(p) term can differ from consumer to consumer, but the b(p) term 
is assumed to be identical for all consumers. By Roy's identity the demand function for 
good j of consumer i will then take the form 

xl(p,m i ) = ai l (p) + P(p)m l . (2.27) 

where, 



«?'(P) = 



/3 J (p) 



dfli(p) 

&(P) 
g6(p) 



6(P) ' 

Note that the marginal propensity to consume good j, dx{(p,mi) / drrii, is independent 

of the level of income of any consumer and also constant across consumers since 6(p) is 

constant across consumers. The aggregate demand for good j will then take the form 

da,i db(p) 



Xi(p,m\...,m n ) = 



^%>) + Hp) h m ' 



This demand function can in fact be generated by a representative consumer. His 
representative indirect utility function is given by 

n 

V(p, M) = J2 Oi(p) + KpW = MP) + B(p)M, 

i=i 

where M = Yli=i m i- 

The proof is simply to apply Roy's identity to this indirect utility function and to note 
that it yields the demand function given in equation (2.27). In fact it can be shown that 
the Gorman form is the most general form of the indirect utility function that will allow 
for aggregation in the sense of the representative consumer model. Hence, the Gorman 
form is not only sufficient for the representative consumer model to hold, but it is also 
necessary. 

Although a complete proof of this fact is rather detailed, the following argument 
is reasonably convincing. Suppose, for the sake of simplicity, that there are only two 
consumers. Then by hypothesis the aggregate demand for good j can be written as 

X j (p,m 1 + m 2 ) = x\(p,mi) + ^(p, m 2 ). 
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If we first differentiate with respect to mi and then with respect to m 2 , we find the 
following identities 

d 2 Xi(p,M) = dxjip^m) = dx J 2 (p,m 2 ) 
dM drrii dm 2 

Hence, the marginal propensity to consume good j must be the same for all consumers. 
If we differentiate this expression once more with respect to mi, we find that 

d 2 XJ(p,M) d 2 x{(p, mi ) 
dM 2 ~ dm 2 

Thus, consumer l's demand for good j - and, therefore, consumer 2's demand - 
is affine in income. Hence, the demand functions for good j take the form x\{p, TOj) = 
a J i (p) + ft (p)mj. If this is true for all goods, the indirect utility function for each consumer 
must have the Gorman form. 

One special case of a utility function having the Gorman form is a utility function that 
is homothetic. In this case the indirect utility function has the form v(p,m) = v(p)m, 
which is clearly of the Gorman form. Another special case is that of a quasi-linear utility 
function. A utility function U is said to be quasi-linear if it has the following functional 
form: 

U (x , Xi, . . . , x L ) = x + u(xi, . . . , x L ) 

In this case v(p,m) = v(p) + m, which obviously has the Gorman form. Many of the 
properties possessed by homothetic and/or quasi-linear utility functions are also possessed 
by the Gorman form. 

The class of quasi-linear utility function is a very important class of utility functions 
which plays an important role in many economics fields such as those of information 
economics, mechanism design theory, property rights theory due to its important property 
of no income effect when income changes. 
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Chapter 3 



Production Theory 

3.1 Introduction 

Economic activity not only involves consumption but also production and trade. Produc- 
tion should be interpreted very broadly however, to include production of both physical 
goods - such as rice or automobiles-and services-such as medical care or financial services. 

A firm can be characterized by many factors and aspects such as sectors, production 
scale, ownerships, organization structures, etc. But which are most important features for 
us to study producer's behavior in making choices? To grasp the most important features 
in studying producer behavior and choices in modern producer theory, it is assumed 
that the key characteristic of a firm is production set. Producer's characteristic together 
with the behavior assumption are building blocks in any model of producer theory. The 
production set represents the set of all technologically feasible production plans. The 
behavior assumption expresses the guiding principle the producer uses to make choices. 
It is generally assumed that the producer seeks to identify and select a production that 
is most profitable. 

We will first present a general framework of production technology. By itself, the 
framework does not describe how production choices are made. It only specifies basic 
characteristic of a firm which defines what choices can be made; it does not specify what 
choices should be made. We then will discuss what choices should be made based on 
the behavior assumptions on firms. A basic behavior assumption on producers is profit 
maximization. After that, we will describe production possibilities in physical terms, 
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which is recast into economic terms - using cost functions. 

3.2 Production Technology 

Production is the process of transforming inputs to outputs. Typically, inputs consist 
of labor, capital equipment, raw materials, and intermediate goods purchased from other 
firms. Outputs consists of finished products or service, or intermediate goods to be sold to 
other firms. Often alternative methods are available for producing the same output, using 
different combinations of inputs. A firm produces outputs from various combinations 
of inputs. In order to study firm choices we need a convenient way to summarize the 
production possibilities of the firm, i.e., which combinations of inputs and outputs are 
technologically feasible. 

3.2.1 Measurement of Inputs and Outputs 

It is usually most satisfactory to think of the inputs and outputs as being measured 
in terms of flows: a certain amount of inputs per the period are used to produce a certain 
amount of outputs per unit the period at some location. It is a good idea to explicitly 
include the time and location dimensions in a specification of inputs and outputs. The 
level of detail that we will use in specifying inputs and outputs will depend on the problem 
at hand, but we should remain aware of the fact that a particular input or output good 
can be specified in arbitrarily fine detail. However, when discussing technological choices 
in the abstract, as we do in this chapter, it is common to omit the time and location 
dimensions. 

3.2.2 Specification of Technology 

The fundamental reality firms must contend with in this process is technological feasibility. 
The state of technology determines and restricts what is possible in combing inputs to 
produce outputs, and there are several way we can represent this constraint. The most 
general way is to think of the firm as having a production possibility set. 

Suppose the firm has L possible goods to serve as inputs and/or outputs. If a firm 
uses i/j units of a good j as an input and produces y°oi the good as an output, then the 
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net output of good j is given by yj — y? — y*. 

A production plan is simply a list of net outputs of various goods. We can represent 
a production plan by a vector y in R L where yj is negative if the j th good serves as a net 
input and positive if the j* h good serves as a net output. The set of all technologically 
feasible production plans is called the firm's production possibilities set and will be 
denoted by Y, a subset of R L . The set Y is supposed to describe all patterns of inputs 
and outputs that are technologically feasible. It gives us a complete description of the 
technological possibilities facing the firm. 

When we study the behavior of a firm in certain economic environments, we may want 
to distinguish between production plans that are "immediately feasible" and those that 
are "eventually" feasible. We will generally assume that such restrictions can be described 
by some vector z in R L . The restricted or short-run production possibilities set 
will be denoted by Y(z); this consists of all feasible net output bundles consistent with 
the constraint level z. The following are some examples of such restrictions. 

EXAMPLE: Input requirement set 

Suppose a firm produces only one output. In this case we write the net output bundle 
as (y, — x) where x is a vector of inputs that can produce y units of output. We can then 
define a special case of a restricted production possibilities set, the input requirement 
set: 

V(y) = {x in R%: (y, -x) is in Y} 

The input requirement set is the set of all input bundles that produce at least y units of 
output. 

Note that the input requirement set, as defined here, measures inputs as positive 
numbers rather than negative numbers as used in the production possibilities set. 

EXAMPLE: Isoquant 

In the case above we can also define an isoquant: 

Q(y) = { x m -^+ : x 1S m V(y) an d x is not in V(y') for y' > y}. 
The isoquant gives all input bundles that produce exactly y units of output. 
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Figure 3.1: Convex input requirement sets. 
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Figure 3.2: Panel A depicts the general shape of a isoquant curve, and panel B depicts 
the general shape of a perfect complement (Leontief) technology. 



EXAMPLE: Short-run production possibilities set 

Suppose a firm produces some output from labor and some kind of machine which we will 
refer to as "capital." Production plans then look like (y, —I, —k) where y is the level of 
output, I the amount of labor input, and k the amount of capital input. We imagine that 
labor can be varied immediately but that capital is fixed at the level k in the short run. 
Then 

Y(k) = {(y, -I, -k) in F: k — k} 
is a short-run production possibilities set. 

EXAMPLE: PRODUCTION FUNCTION 

If the firm has only one output, we can define the production function: /(x) = {y in 
R: y is the maximum output associated with — x in Y}. 

EXAMPLE: TRANSFORMATION FUNCTION 
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A production plan y in Y is (technologically) efficient if there is no y' in Y such that 
y' ^ y and y' ^ y; that is, a production plan is efficient if there is no way to produce 
more output with the same inputs or to produce the same output with less inputs. (Note 
carefully how the sign convention on inputs works here.) We often assume that we can 
describe the set of technologically efficient production plans by a transformation func- 
tion T: — )■ R where T(y) = if and only if y is efficient. Just as a production function 
picks out the maximum scalar output as a function of the inputs, the transformation 
function picks out the maximal vectors of net outputs. 

EXAMPLE: COBB-DOUGLAS TECHNOLOGY 

Let a be a parameter such that < a < 1. Then the Cobb-Douglas technology is 
defined in the following manner. 



Y = {(y, -x u -x 2 ) in R 3 : y ^ x^} 

V(y) = {(x u x 2 ) mR 2 + : y ^ x«x\- a } 

Q(y) = {(x 1 ,x 2 ) mR 2 + : y = x^x\~ a } 

Y(z) = {(y, -x 1 , -x 2 ) in R 3 : y ^ x"x\~ a ) x 2 = z) 

T{y,X!X 2 ) = y-x1x\- a 

f(x u x 2 ) = x a lX \' a . 

EXAMPLE: Leontief technology 

Let a > and b > be parameters. Then the Leontief technology is defined in the 
following manner. 

Y = {(y, — x±, — x 2 ) in R 3 : y ^ min(axi, bx 2 )} 
V(y) = {(x 1 ,x 2 ) in R 2 + : y ^ min(ax l ,bx 2 )} 
Q(y) = {(xi,x 2 ) in R\: y = mm(ax 1 ,bx 2 )} 
T(y,xi,x 2 ) = y — min(axi, bx 2 ) 
f(x l ,x 2 ) = min(aa;i, bx 2 ). 
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3.2.3 Common Properties of Production Sets 

Although the production possibility sets of different processes can differ widely in struc- 
ture, many technologies share certain general properties. If it can be assumed that these 
properties are satisfied, special theoretical results can be derived. Some important prop- 
erties are defined below: 

POSSIBILITY OF INACTION: e Y. 

Possibility of inaction means that no action on production is a possible production 
plan. 

Closeness: Y is closed. 

The possibility set Y is closed means that, whenever a sequence of production plans 
Hi, % = 1, 2, . . . , are in Y and yi — > y, then the limit production plan y is also in Y. It 
guarantees that points on the boundary of Y are feasible. Note that Y is closed implies 
that the input requirement set V(y) is a closed set for all y > 0. 

FREE DISPOSAL OR MONOTONICITY: If y e Y implies that y' E Y for all 
y' ^ y, then the set Y is said to satisfy the free disposal or monotonicity property. 

Free disposal implies that commodities (either inputs or outputs) can be thrown away. 
This property means that if y e Y, then Y includes all vectors in the negative orthant 
translated to y, i.e. there are only inputs, but no outputs. 

A weaker requirement is that we only assume that the input requirement is monotonic: 
If x is in V(y) and x' > x, then x' is in V(y). Monotonicity of V(y) means that, if x 
is a feasible way to produce y units of output and x' is an input vector with at least as 
much of each input, then x' should be a feasible way to produce y. 

IRREVERSIBILITY: Y n {-Y} = {0}. 

Irreversibility means a production plan is not reversible unless it is a non-action plan. 

CONVEXITY: Y is convex if whenever y and y' are in Y, the weighted average ty + 
(1 — t)y is also in Y for any t with ^ t ^ 1. 
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Convexity of Y means that, if all goods are divisible, it is often reasonable to assume 
that two production plans y and y' can be scaled downward and combined. However, 
it should be noted that the convexity of the production set is a strong hypothesis. For 
example, convexity of the production set rules out "start up costs" and other sorts of 
returns to scale. This will be discussed in greater detail shortly. 

STRICT CONVEXITY: y is strictly convex if y G Y and y' G Y, then ty + (l- t)y' G 
intY for all < t < 1, where intY denotes the interior points of Y. 

As we will show, the strict convexity of Y can guarantee the profit maximizing pro- 
duction plan is unique provided it exists. 

A weak and more reasonable requirement is to assume that V(y) is a convex set for 
all outputs y Q : 

CONVEXITY OF INPUT REQUIREMENT SET: // x and x' are in V{y), then 
£x + (1 — t)~x! is in V(y) for all ^ t ^ 1. That is, V(y) is a convex set. 

Convexity of V(y) means that, if x and x' both can produce y units of output, then 
any weighted average tx + (1 — t)x' can also produce y units of output. We describe a 
few of the relationships between the convexity of V(y), the curvature of the production 
function, and the convexity of Y. We first have 

Proposition 3.2.1 (Convex Production Set Implies Convex Input Requirement Set) 

If the production set Y is a convex set, then the associated input requirement set, V(y), 
is a convex set. 

Proof. If Y is a convex set then it follows that for any x and x' such that (y, — x) and 
(y, — x') are in Y for f± t ^ 1, we must have (ty + (1 — t)y, tx — (1 — t)x') in Y. This 
is simply requiring that (y, (tx + (1 — t)x')) is in Y. It follows that if x and x' are in 
V(y),tx + (1 — i)x' is in V(y) which shows that V(y) is convex. 

Proposition 3.2.2 V(y) is a convex set if and only if the production function /(x) is a 
quasi-concave function. 

Proof V(y) = {x: /(x) ^ y}, which is just the upper contour set of /(x). But a function 
is quasi-concave if and only if it has a convex upper contour set. 
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3.2.4 Returns to Scale 

Suppose that we are using some vector of inputs x to produce some output y and we decide 
to scale all inputs up or down by some amount t ^ 0. What will happen to the level of 
output? The notions of returns to scale can be used to answer this question. Returns to 
scale refer to how output responds when all inputs are varied in the same proportion so 
that they consider long run production processes. There are three possibilities: technology 
exhibits (1) constant returns to scale; (2) decreasing returns to scale, and (3) 
increasing returns to scale. Formally, we have 

(GLOBAL) RETURNS TO SCALE. A production function f(x) is said to exhibits : 

(1) constant returns to scale if /(tx) = t/(x) for all t ^ 0; 

(2) decreasing returns to scale if f(tx) < tf(x) for all t > 1; 

(3) increasing returns to scale if f(tx) > £/(x) for all t > 1; 

Constant returns to scale (CRS) means that doubling inputs exactly double outputs, 
which is often a reasonable assumption to make about technologies. Decreasing returns 
to scale means that doubling inputs are less than doubling outputs. Increasing returns to 
scale means that doubling inputs are more than doubling outputs. 

Note that a technology has constant returns to scale if and only if its production 
function is homogeneous of degree 1. Constant returns to scale is also equivalent to: the 
statement y in Y implies ty is in Y for all t ^ 0; or equivalent to the statement x in V(y) 
implies tx is in V(ty) for all t > I. 

It may be remarked that the various kinds of returns to scale defined above are global 
in nature. It may well happen that a technology exhibits increasing returns to scale 
for some values of x and decreasing returns to scale for other values. Thus in many 
circumstances a local measure of returns to scale is useful. To define locally returns to 
scale, we first define elasticity of scale. 

The elasticity of scale measures the percent increase in output due to a one percent 
increase in all inputs- that is, due to an increase in the scale of operations. 

Let y — f(x) be the production function. Let t be a positive scalar, and (consider the 
function y(t) = f(tx). If t — 1, we have the current scale of operation; if t > 1, we are 
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scaling all inputs up by t; and if t < 1, we are scaling all inputs down by t. The elasticity 
of scale is given by 

dy(t) 

e(x) = V®- 
W dt ' 

T 

evaluated at t — 1. Rearranging this expression, we have 

e (x) = 1 df{t ^ f 



dt y 



t=i 



t=l dt /(tx) 

Note that we must evaluate the expression at t — 1 to calculate the elasticity of scale at 
the point x. 

Thus, we have the following the local returns to scale: 
(LOCAL) RETURNS TO SCALE. A production function f(x) is said to exhibits 
locally increasing, constant, or decreasing returns to scale as e(x) is greater, 
equal, or less than 1. 

3.2.5 The Marginal Rate of Technical Substitution 

Suppose that technology is summarized by a smooth production function and that we are 
producing at a particular point y* = f(x\,x%). Suppose that we want to increase a small 
amount of input 1 and decrease some amount of input 2 so as to maintain a constant 
level of output. How can we determine this marginal rate of technical substitution 
(MRTS) between these two factors? The way is the same as for deriving the marginal 
rate of substitution of an indifference curve. Differentiating production function when 
output keeps constant, we have 

= — dx x + — dx 2 , 

OX i OX2 



which can be solved for 



dx 2 df/dxt _ MP X1 



dx! df/dx 2 " MP X2 ' 
This gives us an explicit expression for the marginal rate technical substitution, which is 
the rate of marginal production of x\ and marginal production of x 2 . 
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Example 3.2.1 (MRTS for a Cobb-Douglas Technology) Given that f(x 1 ,x 2 ) = 
Xixl~ a , we can take the derivatives to find 



df(x) 
dx\ 
df(x) 



X2 



~1 1-a 



1 - a)x^x 2 a = (1 - a) 



^2 



It follows that 



dx 2 

dx 2 (xi) df/dxi a X\ 

dxi df/dx 2 1 — ax 2 

3.2.6 The Elasticity of Substitution 

The marginal rate of technical substitution measures the slope of an isoquant. The e- 
lasticity of substitution measures the curvature of an isoquant. More specifically, the 
elasticity of substitution measures the percentage change in the factor ratio divided by the 
percentage change in the MRTS, with output being held fixed. If we let A(x 2 /xi) be the 
change in the factor ratio and AMRTS be the change in the technical rate of substitution, 
we can express this as 

A(x 2 /x 1 ) 
x 2 /x x 
° ~ AMRTS ' 



MRTS 

This is a relatively natural measure of curvature: it asks how the ratio of factor inputs 

changes as the slope of the isoquant changes. If a small change in slope gives us a large 

change in the factor input ratio, the isoquant is relatively flat which means that the 

elasticity of substitution is large. 

In practice we think of the percent change as being very small and take the limit of 

this expression as A goes to zero. Hence, the expression for a becomes 

MRTS d{x 2 /x 1 ) dln(x 2 /x 1 ) 
(x 2 / Xl ) dMRTS ~ din \MRTS\ ' 

(The absolute value sign in the denominator is to convert the MRTS to a positive number 

so that the logarithm makes sense.) 



Example 3.2.2 (The Cobb-Douglas Production Function) We have seen above that 

a x 2 



MRTS = 



1 — a xi ' 
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or 



It follows that 



This in turn implies 



^ = -^MRTS. 

Xi a 



In — =ln- - + \n\MRTS\. 

X\ a 



_ dln(x 2 /x 1 ) _ 
d\n\MRTS\ 



Example 3.2.3 (The CES Production Function) The constant elasticity of sub- 
stitution (CES) production function has the form 

y = [tux? + a 2 x p 2 ]~p. 

It is easy to verify that the CES function exhibits constant returns to scale. It will proba- 
bly not surprise you to discover that the CES production function has a constant elasticity 
of substitution. To verify this, note that the marginal rate of technical substitution is 
given by 

MRTS = - 



x 1 x 



so that 



x 2 



'' 2 = \MRTS\^-p- 



Taking logs, we see that 



xl 

In — = In I MRTS I . 

Xi 1 - p 

Applying the definition of a using the logarithmic derivative, 

d\ax 2 /xi 1 
° = d\n\MRTS\ = 1-p 

3.3 Profit Maximization 
3.3.1 Producer Behavior 

A basic hypothesis on individual firm behavior in the producer theory is that a firm will 
always choose a most profitable production plan from the production set. We will derive 
input demand and output supply functions by considering a model of profit-maximizing 
behavior coupled with a description of underlying production constraints. 
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Economic profit is denned to be the difference between the revenue a firm receives 
and the costs that it incurs. It is important to understand that all (explicit and implicit) 
costs must be included in the calculation of profit. Both revenues and costs of a firm 
depend on the actions taken by the firm. We can write revenue as a function of the level 
of operations of some n actions, R(a±, a n ), and costs as a function of these same n 
activity levels, C(ai, a n ), where actions can be in term of employment level of inputs 
or output level of production or prices of outputs if the firm has a market power to set 
up the prices. 

A basic assumption of most economic analysis of firm behavior is that a firm acts so 
as to maximize its profits; that is, a firm chooses actions (ai,...,a n ) so as to maximize 
R(a\, a n ) — C(ai, a n ). The profit maximization problem facing the firm can be then 
written as 

max R(ai, a n ) — C(ai, a n ). 

ai,...,a n 

The first order conditions for interior optimal actions, a* = (a\, . . . , a*), is characterized 
by the conditions 

&R(a*) <9C(a*) 

— = 1 = 1,.. .,71. 

The intuition behind these conditions should be clear: if marginal revenue were greater 
than marginal cost, it would pay to increase the level of the activity; if marginal revenue 
were less than marginal cost, it would pay to decrease the level of the activity. In general, 
revenue is composed of two parts: how much a firm sells of various outputs times the 
price of each output. Costs are also composed of two parts: how much a firm uses of each 
input times the price of each input. 

The firm's profit maximization problem therefore reduces to the problem of determin- 
ing what prices it wishes to charge for its outputs or pay for its inputs, and what levels 
of outputs and inputs it wishes to use. In determining its optimal policy, the firm faces 
two kinds of constraints: technological constraints that are specified by production sets 
and market constraints that concern the effect of actions of other agents on the firm. The 
firms described in the remainder of this chapter are assumed to exhibit the simplest kind 
of market behavior, namely that of price-taking behavior. Each firm will be assumed to 
take prices as given. Thus, the firm will be concerned only with determining the profit- 
maximizing levels of outputs and inputs. Such a price-taking firm is often referred to as 
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a competitive firm. We will consider the general case in Chapter 6 - the theory of 
markets. 

3.3.2 Producer's Optimal Choice 

Let p be a vector of prices for inputs and outputs of the firm. The profit maximization 
problem of the firm can be stated 

7r(p) = maxpy (3.1) 
such that y is in Y. 

Note that since outputs are measured as positive numbers and inputs are measured as 
negative numbers, the objective function for this problem is profits: revenues minus costs. 
The function 7r(p), which gives us the maximum profits as a function of the prices, is called 
the profit function of the firm. 

There are several useful variants of the profit function: 

Case 1. Short-run maximization problem. In this cae, we might define the short-run 
profit function, also known as the restricted profit function: 

7r(p,z) = maxpy 

such that y is in Y(z). 

Case 2. If the firm produces only one output, the profit function can be written as 

n(p, w) = maxp/(x) — wx 

where p is now the (scalar) price of output, w is the vector of factor prices, and the inputs 
are measured by the (nonnegative) vector x = (x±, ...,x n ). 

The value of y that solves the profit problem (3.1) is in general not unique. When there 
is such a unique production plan, the production plan is called the net output function 
or net supply function, the corresponding input part is called the producer's input 
demand function and the corresponding output vector is called the producer's output 
supply function. We will see from the following proposition that strict convexity of 
production set will ensure the uniqueness of optimal production plan. 
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Proposition 3.3.1 Suppose Y strictly convex. Then, for each given p G 9?+ ; the profit 
maximizing production is unique provide it exists. 

Proof: Suppose not. Let y and y' be two profit maximizing production plans for 
p G 3?+. Then, we must have py = py'. Thus, by the strict convexity of Y, we have 
ty + (1 — t)y' G intY for all < t < 1. Therefore, there exists some k > 1 such that 

k[ty + (1 - t)y'\ G intY. (3.2) 

Then k[tpy+(l—t)py'} = kpy > py which contradicts the fact that y is a profit maximizing 
production plan. 

3.3.3 Producer's First-Order Conditions 

Profit-maximizing behavior can be characterized by calculus when the technology can be 
described by a differentiable production function. For example, the first-order conditions 
for the single output profit maximization problem with interior solution are 

dfCx*) 

P gx = Wi i = l,...,n. (3.3) 
Using vector notation, we can also write these conditions as 

pD/(x*) = w. 

The first-order conditions state that the "marginal value of product of each factor 
must be equal to its price," i.e., marginal revenue equals marginal cost at the profiting 
maximizing production plan. This first-order condition can also be exhibited graphically. 
Consider the production possibilities set depicted in Figure 3.3. In this two-dimensional 
case, profits are given by II = py — wx. The level sets of this function for fixed p and w are 
straight lines which can be represented as functions of the form: y = U/p+ (w/p)x. Here 
the slope of the isoprofit line gives the wage measured in units of output, and the vertical 
intercept gives us profits measured in units of output. The point of maximal profits the 
production function must lie below its tangent line at x*; i.e., it must be "locally concave." 

Similar to the arguments in the consumer theory, the calculus conditions derived above 
make sense only when the choice variables can be varied in an open neighborhood of the 
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n=py-wx 



OUTPUT 



INPUT 



Figure 3.3: Profit maximization when the slope of the isoprofit line equals the slope of 
the production function. 

optimal choice. The relevant first-order conditions that also include boundary solutions 
are given by the Kuhn- Tucker conditions: 



Remark 3.3.1 There may exist no profit maximizing production plan when a produc- 
tion technology exhibits constant returns to scale or increasing returns to scale . For 
example, consider the case where the production function is f(x) = x. Then for p > w no 
profit-maximizing plan will exist. It is clear from this example that the only nontrivial 
profit-maximizing position for a constant-returns-to-scale firm is the case of p = w and 
zero profits. In this case, all production plans are profit-maximizing production plans. If 
(y, x) yields maximal profits of zero for some constant returns technology, then (ty, tx) 
will also yield zero profits and will therefore also be profit-maximizing. 

3.3.4 Sufficiency of Producer's First-Order Condition 

The second-order condition for profit maximization is that the matrix of second derivatives 
of the production function must be negative semi-definite at the optimal point; that 
is, the second-order condition requires that the Hessian matrix 



must satisfy the condition that hD 2 /(x*)h' f± for all vectors h. (The prime indicates 
the transpose operation.) Geometrically, the requirement that the Hessian matrix is 




(3.4) 
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negative semi-definite means that the production function must be locally concave in the 
neighborhood of an optimal choice. Formerly, we have the following proposition. 

Proposition 3.3.2 Suppose that f(x) is differentiable and concave on and (p, w) > 0. 

If x > satisfies the first-order conditions given in (3.4), then x is (globally) profit 
maximizing production plan at prices (p,w). 

Remark 3.3.2 The strict concavity of /(x) can be checked by verifying if the leading 
principal minors of the Hessian must alternate in sign, i.e., 





/ll 


fl2 




/21 


/22 
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/l2 


/l3 


/21 


/22 


/23 


/31 


f32 


/33 



> 0, 
< 0, 



and so on, where = dx J x - This algebraic condition is useful for checking second-order 
conditions. 

Example 3.3.1 (The Profit Function for Cobb-Douglas Technology) Consider the 
problem of maximizing profits for the production function of the form f(x) = x a where 
a > 0. The first-order condition is 

pax' 1 ' 1 = w, 
and the second-order condition reduces to 

pa{a - l)x a ~ 2 ^ 0. 

The second-order condition can only be satisfied when a ^ 1, which means that the 
production function must have constant or decreasing returns to scale for competitive 
profit maximization to be meaningful. 

If a — 1, the first-order condition reduces to p = w. Hence, when w = p any value of 
a; is a profit-maximizing choice. When a < 1, we use the first-order condition to solve for 
the factor demand function 



x(p, w) = 
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The supply function is given by 



y(p,w) = f(x(p,w)) 



( 



ap 



w 




and the profit function is given by 



7r(p, w) = py(p, w) — wx(p, w) = w 



( 



1 - 



a 



a 




a-l 



3.3.5 Properties of Net Supply Functions 

In this section, we show that the net supply functions are the solutions to the profit 
maximization problem that in fact have imposed certain restrictions on the behavior of 
the demand and supply functions. 

Proposition 3.3.3 Net output functions y{p) are homogeneous of degree zero, i.e., y(tp) = 
y{p) for all t > 0. 

Proof. It is easy to see that if we multiply all of the prices by some positive number t, the 
production plan that maximizes profits will not change. Hence, we must have y(tp) = y{p) 
for all t > 0. 

Proposition 3.3.4 (Negative Definiteness of Substitution Matrix) Let y = f(x) 
be a twice differentiable and strictly concave single output production function, and let 
x(p, w) be the input demand function. Then, the substitution matrix 



is symmetric negative definite. 

Proof. Without loss of generality, we normalize p — 1. Then the first-order conditions 
for profit maximization are 



Dx(p, w) 



dxj(p,w) 
dwj 



D/(x(w)) - w = 0. 



If we differentiate with respect to w, we get 



D 2 /(x(w))Dx(w) — I = 0. 



Solving this equation for the substitution matrix, we find 



Dx(w) =[D 2 /(x(w))] 



-i 
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Recall that the second-order condition for (strict) profit maximization is that the 
Hessian matrix is a symmetric negative definite matrix. It is a standard result of linear 
algebra that the inverse of a symmetric negative definite matrix is a symmetric negative 
definite matrix. Then, D 2 /(x(w)) is a symmetric negative definite matrix, and thus the 
substitution matrix Dx(w) is a symmetric negative definite matrix. This means that the 
substitution matrix itself must be a symmetric, negative definite matrix. 

Remark 3.3.3 Note that since Dx(p,w) symmetric, negative definite, we particularly 
have: 

(1) dxi/dwi < 0, for % = 1,2,. . . ,n since the diagonal entries of a negative 
definite matrix must be negative. 

(2) dxi/dvjj = dxj/dwi by the symmetry of the matrix. 
3.3.6 Weak Axiom of Profit Maximization 

In this subsection we examine the consequences of profit-maximizing behavior. Suppose 
that we have are given a list of observed price vectors p*, and the associated net output 
vectors y*, for t — 1, ...,T. We refer to this collection as the data. In terms of the net 
supply functions we described before, the data are just (p',y(p < )) for some observations 
t — 1, T. If the firm is maximizing profits, then the observed net output choice at price 
p* must have a level of profit at least as, great as the profit at any other net output the 
firm could have chosen. Thus, a necessary condition for profit maximization is 

PV ^ pV for alH and s = 1, . . . , T. 

We will refer to this condition as the Weak Axiom of Profit Maximization (WTAPM). 

In Figure 3.4A we have drawn two observations that violate WAPM, while Figure 3.4B 
depicts two observations that satisfy WAPM. 

WAPM is a simple, but very useful, condition; let us derive some of its consequences. 
Fix two observations t and s, and write WAPM for each one. We have 

p'(y*-y s ) ^ o 
-pV-y s ) ^ o. 
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Figure 3.4: WAPM. Panel A shows two observations that violate WAPM since p 1 y 2 > 
p 1 y 1 . Panel B shows two observations that satisfy WAPM. 

Adding these two inequalities gives us 

(p t -p»)(y*-y a ) ^0. 
Letting Ap = (p*— p s ) and Ay = (y*— y s ), we can rewrite this expression as 

ApAy ^ 0. 

In other words, the inner product of a vector of price changes with the associated vector 
of changes in net outputs must be nonnegative. 



3.3.7 Recoverability 

Does WAPM exhaust all of the implications of profit-maximizing behavior, or are there 
other useful conditions implied by profit maximization? One way to answer this question 
is to try to construct a technology that generates the observed behavior (p*^ 4 ) as profit- 
maximizing behavior. If we can find such a technology for any set of data that satisfy 
WAPM, then WAPM must indeed exhaust the implications of profit-maximizing behavior. 
We refer to the operation of constructing a technology consistent with the observed choices 
as the operation of recoverability. 

We will show that if a set of data satisfies WAPM it is always possible to find a 
technology for which the observed choices are profit-maximizing choices. In fact, it is 
always possible to find a production set Y that is closed and convex. The remainder of 
this subsection will sketch the proof of this assertion. Formerly, we have 
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Proposition 3.3.5 For any set of data satisfies WAPM, there is a convex and closed 
production set such that the observed choices are profit-maximizing choices. 

Proof. We want to construct a convex and closed production set that will generate 
the observed choices (p*,y*) as profit-maximizing choices. We can actually construct two 
such production sets, one that serves as an " inner bound" to the true technology and 
one that serves as an " outer bound." We start with the inner bound. 

Suppose that the true production set Y is convex and monotonic. Since Y must 
contain y* for t — 1, T, it is natural to take the inner bound to be the smallest convex, 
monotonic set that contains y 1 ,...^*. This set is called the convex, monotonic hull of 
the points y 1 , ...,y T and is denoted by 

YI = convex, monotonic hull of {y* : t — 1, • • • , T} 

The set YI is depicted in Figure 3.5. 

OUTPUT 




— INKU I 

INPUT 



Figure 3.5: The set of YI and YO. 

It is easy to show that for the technology YI, y* is a profit-maximizing choice prices 
p*. All we have to do is to check that for all t, 

p*y* ^ P*y for all y in YI. 

Suppose that this is not the case. Then for some observation t, p t y t < p*y for some y 
in YI. But inspecting the diagram shows that there must then exist some observation s 
such that p*y* < P*y s • But this inequality violates WAPM. 

Thus the set YI rationalizes the observed behavior in the sense that it is one possible 
technology that could have generated that behavior. It is not hard to see that YI must be 
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contained in any convex technology that generated the observed behavior: if Y generated 
the observed behavior and it is convex, then it must contain the observed choices y* and 
the convex hull of these points is the smallest such set. In this sense, YI gives us an 
"inner bound" to the true technology that generated the observed choices. 

It is natural to ask if we can find an outer bound to this "true" technology. That is, 
can we find a set YO that is guaranteed to contain any technology that is consistent with 
the observed behavior? 

The trick to answering this question is to rule out all of the points that couldn't possibly 
be in the true technology and then take everything that is left over. More precisely, let 
us define NOTY by 

NOTY = {y: p*y > p*y* for some t}. 

NOTY consists of all those net output bundles that yield higher profits than some 
observed choice. If the firm is a profit maximizer, such bundles couldn't be technologically 
feasible; otherwise they would have been chosen. Now as our outer bound to Y we just 
take the complement of this set: 

YO = {y: p'y ^ pV for all t = 1, ...,T}. 

The set YO is depicted in Figure 3.5B. 

In order to show that YO rationalizes the observed behavior we must show that the 
profits at the observed choices are at least as great as the profits at any other y in YO. 
Suppose not. Then there is some y* such that p*y*< pV 101 some y in YO. But this 
contradicts the definition of YO given above. It is clear from the construction of YO that 
it must contain any production set consistent with the data (y*). Hence, YO and YI 
form the tightest inner and outer bounds to the true production set that generated the 
data. 



3.4 Profit Function 

Given any production set Y, we have seen how to calculate the profit function, 7r(p), 
which gives us the maximum profit attainable at prices p. The profit function possesses 
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several important properties that follow directly from its definition. These properties are 
very useful for analyzing profit-maximizing behavior. 

3.4.1 Properties of the Profit Function 

The properties given below follow solely from the assumption of profit maximization. No 
assumptions about convexity, monotonicity, or other sorts of regularity are necessary. 

Proposition 3.4.1 (Properties of the Profit Function) The following has the fol- 
lowing properties: 

(1) Nondecreasing in output prices, nonincreasing in input prices. If ' p\ ^ pi for all 
outputs and p'j ^ pj, for all inputs, then ir(p f ) ^ 7r(p)- 

(2) Homogeneous of degree 1 in p. n{tp) = t7r(p) for all t ^ 0. 

(3) Convexmp. Let p" = ip+(l-t)p' for ^ t S 1- Then 7r(p") ^ i7r(p)+(l-t)7r(p'). 

(4) Continuous in p. The function 7r(p) is continuous, at least when 7r(p) is well-defined 
and pi > for i = 1, n. 

Proof. 

1. Let y be a profit-maximizing net output vector at p, so that 7r(p) = py and let 
y' be a profit-maximizing net output vector at p' so that 7r(p') = p'y'. Then by 
definition of profit maximization we have p'y'= P V- Since p\ ^ pi for all % for which 
Ui ^ and p\ ^ pi for all i for which ^ 0, we also have p'y ^ py. Putting these 
two inequalities together, we have 7r(p') = p'y' ^ py = 7r(p), as required. 

2. Let y be a profit-maximizing net output vector at p, so that py ^ py' for all y' in 
Y . It follows that for t ^ 0, tpy ^ tpy' for all y' in Y . Hence y also maximizes 
profits at prices tp. Thus n(tp) = tpy = tn(p). 

3. Let y maximize profits at p,y' maximize profits at p', and y" maximize profits at 
p". Then we have 

*(p") = PV = (tp + (1 - t)p')y" = tpy" + (1 - t)p'y". (3.1) 
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By the definition of profit maximization, we know that 

*py" ^ ^py = tn(p) 
(i-OpV ^ (i-0pV = (i-*Mp')- 

Adding these two inequalities and using (3.1), we have 

7r(p")^7r(p) + (l-t)7r(p'), 

as required. 

4. The continuity of 7r(p) follows from the Theorem of the Maximum. 

3.4.2 Deriving Net Supply Functions from Profit Function 

If we are given the net supply function y(p), it is easy to calculate the profit function. We 
just substitute into the definition of profits to find 7r(p) = py(p). Suppose that instead 
we are given the profit function and are asked to find the net supply functions. How can 
that be done? It turns out that there is a very simple way to solve this problem: just 
differentiate the profit function. The proof that this works is the content of the next 
proposition. 

Proposition 3.4.2 (Hotelling's lemma.) Let y,(p) be the firm's net supply function 
for good i. Then 

ViiP) = fori = l,...,n, 

dpi 

assuming that the derivative exists and that Pi > 0. 

Proof. Suppose (y*) is a profit-maximizing net output vector at prices (p*). Then 
define the function 

g(p) = 7r(p) - py*. 

Clearly, the profit-maximizing production plan at prices p will always be at least as 
profitable as the production plan y* . However, the plan y* will be a profit-maximizing 
plan at prices p*, so the function g reaches a minimum value of at p*. The assumptions 
on prices imply this is an interior minimum. 
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The first-order conditions for a minimum then imply that 

dg(p*) dnjp*) „ . 

— = — y i= for 2 = l,...,n. 

Since this is true for all choices of p*, the proof is done. 

Remark 3.4.1 Again, we can prove this derivative property of the profit function by 
applying Envelope Theorem: 

dn(p) dpy{p) 



dpi 



dpi 



x=x(a) 



This expression says that the derivative of n with respect to a is given by the partial 
derivative of / with respect to pi, holding x fixed at the optimal choice. This is the 
meaning of the vertical bar to the right of the derivative. 



3.5 Cost Minimization 

An important implication of the firm choosing a profit-maximizing production plan is 
that there is no way to produce the same amounts of outputs at a lower total input cost. 
Thus, cost minimization is a necessary condition for profit maximization. This observation 
motives us to an independent study of the firm's cost minimization. The problem is of 
interest for several reasons. First, it leads us to a number of results and constructions 
that are technically very useful. Second, as long as the firm is a price taker in its input 
market, the results flowing from the cost minimization continue to be valid whether or 
not the output market is competitive and so whether or not the firm takes the output 
price as given as. Third, when the production set exhibits nondecreasing returns to scale, 
the cost function and optimizing vectors of the cost minimization problem, which keep 
the levels of outputs fixed, are better behaved than the profit function. 

To be concrete, we focus our analysis on the single-output case. We assume throughout 
that firms are perfectly competitive on their input markets and therefore they face fixed 
prices. Let w = (wi, w 2 , ■ ■ ■ , w n ) ^ be a vector of prevailing market prices at which the 
firm can buy inputs x — (xi, x 2 , ■ ■ ■ , x n ). 



118 



3.5.1 First-Order Conditions of Cost Minimization 

Let us consider the problem of finding a cost-minimizing way to produce a given level of 
output: 

min wx 

X 

such that /(x) ^ y 

We analyze this constrained minimization problem using the Lagrangian function: 

£(A,x) = wx-A(/(x)-y) 

where production function / is assumed to be differentiate and A is the Lagrange multi- 
plier. The first-order conditions characterizing an interior solution x* are 

w^X^—l = 0, i = l,...,ra (3.5) 

/CO = y (3.6) 

or in vector notation, the condition can be written as 

w = AD/(x*). 

We can interpret these first-order conditions by dividing the j th condition by the i th 
condition to get 

0/(x*) 

^ = ^) *>j = ( 3 - 7 ) 
dxj 

which means the marginal rate of technical substitution of factor % for factor j equals the 
economic rate of substitution factor i for factor i at the cost minimizing input bundle. 

This first-order condition can also be represented graphically. In Figure 3.6, the curved 
lines represent iso-quants and the straight lines represent constant cost curves. When y is 
fixed, the problem of the firm is to find a cost-minimizing point on a given iso-quant. It 
is clear that such a point will be characterized by the tangency condition that the slope 
of the constant cost curve must be equal to the slope of the iso-quant. 

Again, the conditions are valid only for interior operating positions: they must be 
modified if a cost minimization point occurs on the boundary. The appropriate conditions 
turn out to be 

dfCx*) 

A — Wi ^ with equality if X{ > 0, i — 1, 2, . . . , n (3.8) 

dxi 
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Figure 3.6: Cost minimization. At a point that minimizes costs, the isoquant must be 
tangent to the constant cost line. 

Remark 3.5.1 It is known that a continuous function achieves a minimum and a max- 
imum value on a closed and bounded set. The objective function wx is certainly a 
continuous function and the set V(y) is a closed set by hypothesis. All that we need to 
establish is that we can restrict our attention to a bounded subset of V(y). But this is 
easy. Just pick an arbitrary value of x, say x'. Clearly the minimal cost factor bundle 
must have a cost less than wx'. Hence, we can restrict our attention to the subset {x in 
V(y): wx ^ wx'}, which will certainly be a bounded subset, as long as w > 0. Thus the 
cost minimizing input bundle always exists. 

3.5.2 Sufficiency of First-Order Conditions for Cost Minimiza- 
tion 

Again, like consumer's constrained optimization problem, the above first-order conditions 
are merely necessary conditions for a local optimum. However, these necessary first- 
order conditions are in fact sufficient for a global optimum when a production function is 
quasi-concave, which is formerly stated in the following proposition. 

Proposition 3.5.1 Suppose that f(x) : R™ — > R is differentiable and quasi-concave on 
R™ and w > 0. If (x, A) > satisfies the first-order conditions given in (3.5) and (3.6), 
then x solves the firm's cost minimization problem at prices w. 

Proof. Since f(x) :— > R™ is differentiable and quasi-concave, the input requirement set 
V(y) = {x : f(x) ^ y} is a convex and closed set. Further the object function wx is 
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convex and continuous, then by the Kuhn- Tucker theorem, the first-order conditions are 
sufficient for the constrained minimization problem. 

Similarly, the strict quasi-concavity of / can be checked by verifying if the naturally 
ordered principal minors of the bordered Hessian alternative in sign, i.e., 

h h 
fi fn fl2 > 0, 
J2 /21 f'22 

h h h 

fl fll fl2 fl3 
f2 /21 f22 /23 

h /31 f'32 fs3 

and so on, where f t = f£ and fa = ^g^. 

For each choice of w and y there will be some choice of x* that minimizes the cost of 
producing y units of output. We will call the function that gives us this optimal choice 
the conditional input demand function and write it as x(w, y). Note that conditional 
factor demands depend on the level of output produced as well as on the factor prices. 
The cost function is the minimal cost at the factor prices w and output level y; that is. 
c(w,y) = wx(w,y). 



< 0, 



Example 3.5.1 (Cost Function for the Cobb-Douglas Technology) Consider the 
cost minimization problem 



c(w,y) 



mm W\Xi + W2X2 

Xl,X2 



such that Ax\x\ = y. 
Solving the constraint for x 2 , we see that this problem is equivalent to 



The first-order condition is 



min w\Xi + w 2 A byb Xl b . 

XI 



a ^-i i 

wi - fo w z A b y bx i = 0? 



which gives us the conditional input demand function for factor 1: 



x 1 (w 1 ,w 2 ,y) = A *+b 



aw 1 
bwi 



a+b 



I a + b . 
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The other conditional input demand function is 



x 2 (w 1 ,w 2 ,y) = A »+* 



aw 2 
bwi 



a+b 



la+b . 



The cost function is thus 



c(w 1 ,w 2 ,y) = w 1 ,x 1 (w 1 ,w 2 ,y) + w 2 ,x 2 (w 1 ,w 2 ,y) 

b —a 

.6/ U 



When A = 1 and a + 6 = 1 (constant returns to scale), we particularly have 

c(w 1 ,w 2 y) = Kw a l wl~ a y J 

where K = cr a (l -a) a -\ 

Example 3.5.2 (The Cost function for the CES technology) Suppose that f(x 1: x 2 ) 
(x p + x p 2 ) p . What is the associated cost function? The cost minimization problem is 

min W\Xi + w 2 x 2 
such that x p + x p 2 = y p 

The first-order conditions are 

w 1 — Xpx^ 1 = 

w 2 — \px p 2 x = 

p 1 p 
x\+a% = y. 

Solving the first two equations for x p and x 2 , we have 

x p = wf^iXp)^ (3.9) 
x p 2 = wf^iXp)^. (3.10) 

Substitute this into the production function to find 

(\p)^ (wf 1 +wf~^ =y p . 

— p 

Solve this for (Ap)p 3I and substitute into equations (3.9) and (3.10). This gives us the 
conditional input demand functions 



x^w^w^y) = w[ 



1 



wf- 1 + wf 1 I I) 



1 



x 2 (w u w 2 ,y) = 



p-i 



w^ 1 + wf 1 
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Substituting these functions into the definition of the cost function yields 



c(w u w 2 ,y) 




= y w{ 1 + w 2 P 



ii 



p-i 



p -i 



p 



This expression looks a bit nicer if we set r = p/(p — 1) and write 



c(u>i,u> 2 ,y) = yK + tu^- 



Note that this cost function has the same form as the original CES production function 
with r replacing p. In the general case where 



Example 3.5.3 (The Cost function for the Leontief technology) Suppose f(x±, x 2 ) = 
mm{axi, bx 2 }. Since we know that the firm will not waste any input with a positive price, 
the firm must operate at a point where y = ax± = bx 2 . Hence, if the firm wants to produce 
y units of output, it must use y/a units of good 1 and y/b units of good 2 no matter what 
the input prices are. Hence, the cost function is given by 



Example 3.5.4 (The cost function for the linear technology) Suppose that f(xi,x 2 ) = 
axi+bx 2 , so that factors 1 and 2 are perfect substitutes. What will the cost function look 
like? Since the two goods are perfect substitutes, the firm will use whichever is cheaper. 
Hence, the cost function will have the form c(wi,w 2 ,y) = mm{wi/a,w 2 /b}y. 

In this case the answer to the cost-minimization problem typically involves a boundary 
solution: one of the two factors will be used in a zero amount. It is easy to see the answer 
to this particular problem by comparing the relative steepness of the isocost line and 
isoquant curve. If ^ < the firm only uses x 2 and the cost function is given by 
c(wi,w 2 ,y) = w 2 x 2 = mj 2 ^. If ^ > the firm only uses X\ and the cost function is 
given by c(wi,w 2 ,y) = w^! =w 1 ^-. 



f(xix 2 ) = [{axXxY + (a 2 x 2 ) p ]p, 



similar computations can be done to show that 



c(w 1 ,w 2 ,y) = [{wi/ai) r + (w 2 /a 2 ) r ]ry. 
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3.6 Cost Functions 



The cost function measures the minimum cost of producing a given level of output for 
some fixed factor prices. As such it summarizes information about the technological 
choices available to the firms. It turns out that the behavior of the cost function can 
tell us a lot about the nature of the firm's technology. In the following we will first 
investigate the behavior of the cost function c(w, y) with respect to its price and quantity 
arguments. We then define a few related functions, namely the average and the marginal 
cost functions. 

3.6.1 Properties of Cost Functions 

You may have noticed some similarities here with consumer theory. These similarities are 
in fact exact when one compares the cost function with the expenditure function. Indeed, 
consider their definitions. 

(1) Expenditure Function: e(p, u) = min x£ M.n px such that u(x) ^ u 

(2) Cost Function: c(w,y) = mm xe ^ wx such that f\x) ^ y 

Mathematically, the two optimization problems are identical. Consequently, for every 
theorem we proved about expenditure functions, there is an equivalent theorem for cost 
functions. We shall state these results here, but we do not need to prove them. Their 
proofs are identical to those given for the expenditure function. 

Proposition 3.6.1 [Properties of the Cost Function.] Suppose the production function f 
is continuous and strictly increasing. Then the cost function has the following properties: 

(1) c(w,y) is nondecreasing in w. 

(2) c(w,y) is homogeneous of degree 1 in w. 

(3) c(w, y) is concave in w. 

(4) c(w,y) is continuous in w 7 for w > 0. 

(5) For all w > 0, c(w,y) is strictly increasing y. 
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(6) Shephard's lemma: 7/x(w, y) is the cost-minimizing bundle necessary to produce 
production level y at prices w, then Xj(w, y) = dc ^ y ^ for % — 1, ...,n assuming the 
derivative exists and that X; L > 0. 



3.6.2 Properties of Conditional Input Demand 

As solution to the firm's cost-minimization problem, the conditional input demand func- 
tions possess certain general properties. These are analogous to the properties of Hicksian 
compensation demands, so once again it is not necessary to repeat the proof. 

Proposition 3.6.2 (Negative Semi-Definite Substitution Matrix) The matrix of 
substitution terms (dxj(w,y)/dwi) is negative semi- definite. 

Again since the substitution matrix is negative semi-definite, thus it is symmetric and 
has non-positive diagonal terms. We then particularly have 

Proposition 3.6.3 (Symmetric Substitution Terms) The matrix of substitution terms 
is symmetric, i.e., 

dxj{w,y) = d 2 c{w,y) = d 2 c(w,y) = dxj{w,y) 
dwi dvjjdwi dwidvjj dvjj 

Proposition 3.6.4 (Negative Own-Substitution Terms) The compensated own-price 
effect is non-positive; that is, the input demand curves slope downward: 

dxj(w,y) = d 2 c(w,y) < 
dwi dwf 

Remark 3.6.1 Using the cost function, we can restate the firm's profit maximization 
problem as 

max py — c(w, y). (3. 11) 

The necessary first-order condition for y* to be profit-maximizing is then 

p - dC<y ™ y V - ^ 0, with equality if y* > 0. (3.12) 

In other words, at an interior optimum (i.e., y* > 0), price equals marginal cost. If c(w, y) 
is convex in y, then the first-order condition (3.12) is also sufficient for y* to be the firm's 
optimal output level. 
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3.6.3 Average and Marginal Costs 

Let us consider the structure of the cost function. Note that the cost function can always 
be expressed simply as the value of the conditional factor demands. 

c(w,y) = wx(w,y) 

In the short run, some of the factors of production are fixed at predetermined levels. 
Let x/ be the vector of fixed factors, x„, the vector of variable factors, and break up 
w into w = (w„,W/), the vectors of prices of the variable and the fixed factors. The 
short-run conditional factor demand functions will generally depend on x/, so we write 
them as x„(w, y,x/). Then the short-run cost function can be written as 

c(w, y, x f ) = w„x„(w, y, X/) + W/X/. 

The term w„x t ,(w, y, x/) is called short-run variable cost (SVC), and the term w/X/ is 
the fixed cost (FC). We can define various derived cost concepts from these basic units: 



short-run total cost = STC = w v x. v (w, y, xj) + w^xj 
short-run average cost = SAC = c ( w ' ^' x -f) 

y 

short-run average variable cost = SAVC = — — — — — — — 

y 

W/X/ 

short-run average fixed cost = SAFC = — - — - 

y 

i -i n**^ dc(w,y,^.f) 

short-run marginal cost = SMC = — . 

oy 

When all factors are variable, the firm will optimize in the choice of xj. Hence, 
the long-run cost function only depends on the factor prices and the level of output as 
indicated earlier. We can express this long-run function in terms of the short-run cost 
function in the following way. Let xj(w, y) be the optimal choice of the fixed factors, and 
let x„(w, y) = x„(w, y, x/(w, y)) be the long-run optimal choice of the variable factors. 
Then the long-run cost function can be written as 

c(w, y) = w v x v (w, y) + w/X/(w, y) = c(w, y, x/(w, y)). 
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Similarly, we can define the long-run average and marginal cost functions: 

1 +. t An c ( w >y) 

long-run average cost = LAC = 

y 

<9c(w y) 

long-run marginal cost = LMC = — - — . 

oy 

Notice that "long-run average cost" equals "long-run average variable cost" since all costs 
are variable in the long-run; "long-run fixed costs" are zero for the same reason. 

Example 3.6.1 (The short-run Cobb-Douglas cost functions) Suppose the second 
factor in a Cobb-Douglas technology is restricted to operate at a level k. Then the cost- 
minimizing problem is 

min WiX\ + w 2 k 
such that y = x^k 1 ' 11 . 

Solving the constraint for X\ as a function of y and k gives 
Thus 

c(w 1 ,w 2 ,y, k) = wi(yk a ~ 1 )* + w 2 k. 
The following variations can also be calculated: 

-y\ W2k 

y 

1-a 



short-run average cost = w 1 ( - 

\k 

short-run average variable cost = Wl (j^j 

w 2 k 



short-run average fixed cost = 



y 



short-run marginal cost = — ( \ ) 

a \k/ 

Example 3.6.2 (Constant returns to scale and the cost function) If the produc- 
tion function exhibits constant returns to scale, then it is intuitively clear that the cost 
function should exhibit costs that are linear in the level of output: if you want to produce 
twice as much output it will cost you twice as much. This intuition is verified in the 
following proposition: 
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Proposition 3.6.5 (Constant returns to scale) If the production function exhibits con- 
stant returns to scale, the cost function may be written as c(w,y) = yc(w, 1). 

Proof. Let x* be a cheapest way to produce one unit of output at prices w so that 
c(w, 1) = wx*. We want to show that c(w, y) = wyx* = yc(w, 1). Notice first that yx* 
is feasible to produce y since the technology is constant returns to scale. Suppose that 
it does not minimize cost; instead let x' be the cost-minimizing bundle to produce y at 
prices w so that wx' < wyx*. Then wx'/y < wx* and x'/y can produce 1 since the 
technology is constant returns to scale. This contradicts the definition of x*. 

Thus, if the technology exhibits constant returns to scale, then the average cost, the 
average variable cost, and the marginal cost functions are all the same. 

3.6.4 The Geometry of Costs 

Let us first examine the short-run cost curves. In this case, we will write the cost function 
simply as c(y), which has two components: fixed costs and variable costs. We can therefore 
write short-run average cost as 

SAC = c(w,y, X/ ) = w^x, + w^w.y.x,) = ^ + ^ 

y y y 

As we increase output, average variable costs may initially decrease if there is some 
initial region of economies of scale. However, it seems reasonable to suppose that the vari- 
able factors required will eventually increase by the low of diminishing marginal returns, 
as depicted in Figure 3.7. Average fixed costs must of course decrease with output, as 
indicated in Figure 3.7. Adding together the average variable cost curve and the average 
fixed costs gives us the [/-shaped average cost curve in Figure 3.7. The initial decrease in 
average costs is due to the decrease in average fixed costs; the eventual increase in average 
costs is due to the increase in average variable costs. The level of output at which the 
average cost of production is minimized is sometimes known as the minimal efficient 
scale. 

In the long run all costs are variable costs and the appropriate long-run average cost 
curve should also be [/-shaped by the facts that variable costs usually exhibit increasing 
returns to scale at low lever of production and ultimately exhibits decreasing returns to 
scale. 
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Figure 3.7: Average variable, average fixed, and average cost curves. 



Let us now consider the marginal cost curve. What is its relationship to the average 
cost curve? Since 



Thus, the average variable cost curve is decreasing when the marginal cost curve lies 
below the average variable cost curve, and it is increasing when the marginal cost curve 
lies above the average variable cost curve. It follows that average cost reach its minimum 
at y* when the marginal cost curve passes through the average variable cost curve, i.e., 



Remark 3.6.2 All of the analysis just discussed holds in both the long and the short 
run. However, if production exhibits constant returns to scale in the long run, so that 
the cost function is linear in the level of output, then average cost, average variable cost, 
and marginal cost are all equal to each other, which makes most of the relationships just 
described rather trivial. 

3.6.5 Long-Run and Short-Run Cost Curves 

Let us now consider the relationship between the long-run cost curves and the short-run 
cost curves. It is clear that the long-run cost curve must never lie above any short-run 
cost curve, since the short-run cost minimization problem is just a constrained version of 
the long-run cost minimization problem. 

Let us write the long-run cost function as c(y) = c(y, z(y)). Here we have omitted the 
factor prices since they are assumed fixed, and we let z(y) be the cost-minimizing demand 
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for a single fixed factor. Let y* be some given level of output, and let z* = z(y*) be the 
associated long-run demand for the fixed factor. The short-run cost, c(y,z*), must be at 
least as great as the long-run cost, c(y,z(y)), for all levels of output, and the short-run 
cost will equal the long-run cost at output y*, so c(y*, z*) = c(y* , z(y*)). Hence, the long- 
and the short-run cost curves must be tangent at y*. 

This is just a geometric restatement of the envelope theorem. The slope of the long-run 
cost curve at y* is 

dc(y*,z(y*)) = dc(y*,z*) dc(y* , z*) dz(y*) 
dy dy dz dy 

But since z* is the optimal choice of the fixed factors at the output level y*, we must have 

dc(y*,z*) = Q 
dz 

Thus, long-run marginal costs at y* equal short-run marginal costs at (y*,z*). 

Finally, we note that if the long- and short-run cost curves are tangent, the long- and 
short-run average cost curves must also be tangent. A typical configuration is illustrated 
in Figure 3.8. 

AC 

AC(yW) 



y" OUTPUT 

Figure 3.8: Long-run and short-run average cost curves. Note that the long-run and 
the short-run average cost curves must be tangent which implies that the long-run and 
short-run marginal cost must equal. 




3.7 Duality in Production 

In the last section we investigated the properties of the cost function. Given any technol- 
ogy, it is straightforward, at least in principle, to derive its cost function: we simply solve 
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the cost minimization problem. 

In this section we show that this process can be reversed. Given a cost function we can 
"solve for" a technology that could have generated that cost function. This means that 
the cost function contains essentially the same information that the production function 
contains. Any concept defined in terms of the properties of the production function has 
a "dual" definition in terms of the properties of the cost function and vice versa. This 
general observation is known as the principle of duality. 

3.7.1 Recovering a Production Set from a Cost Function 

Given data (w*,x*,y*), define VO(y) as an "outer bound" to the true input requirement 
set V(y): 

VO(y) = {x: w*x ^ w*x* for all t such that y l ^y}. 

It is straightforward to verify that VO(y) is a closed, monotonic, and convex technology. 
Furthermore, it contains any technology that could have generated the data (w*,x*,y') 
for t = 1,...,T. 

If we observe choices for many different factor prices, it seems that VO(y) should 
"approach" the true input requirement set in some sense. To make this precise, let the 
factor prices vary over all possible price vectors w ^ 0. Then the natural generalization 
of VO becomes 

V*(y) = {x: wx ^ wx(w,y) = c(w,y) for all w ^ 0}. 

What is the relationship between V*(y) and the true input requirement set V(y)7 
Of course, V*(y) clearly contain V(y). In general, V*(y) will strictly contain V(y). For 
example, in Figure 3.9A we see that the shaded area cannot be ruled out of V*(y) since 
the points in this area satisfy the condition that wx ^ c(w,y). 

The same is true for Figure 3.9B. The cost function can only contain information 
about the economically relevant sections of V(y), namely, those factor bundles that could 
actually be the solution to a cost minimization problem, i.e., that could actually be 
conditional factor demands. 

However, suppose that our original technology is convex and monotonic. In this case 
V*(y) will equal V(y). This is because, in the convex, monotonic case, each point on 
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Figure 3.9: Relationship between V(y) and V*(y). In general, V*(y) will strictly contain 

V{y). 



the boundary of V(y) is a cost-minimizing factor demand for some price vector w ^ 0. 
Thus, the set of points where wx ^ c(w, y) for all w ^ will precisely describe the input 
requirement set. More formally: 

Proposition 3.7.1 (Equality of V(y) and V*(y)) Suppose V(y) is a closed, convex, 
monotonic technology. Then V*(y) = V(y). 

Proof (Sketch) We already know that V*(y) contains V(y), so we only have to show 
that if x is in V*(y) then x must be in V(y). Suppose that x is not an element of V(y). 
Then since V(y) is a closed convex set satisfying the monotonicity hypothesis, we can 
apply a version of the separating hyperplane theorem to find a vector w* ^ such that 
w*x < w*z for all z in V(y). Let z* be a point in V(y) that minimizes cost at the prices 
w*. Then in particular we have w*x < w*z* = c(w*,y). But then x cannot be in V*(y), 
according to the definition of V*(y). 

This proposition shows that if the original technology is convex and monotonic, then 
the cost function associated with the technology can be used to completely reconstruct 
the original technology. This is a reasonably satisfactory result in the case of convex 
and monotonic technologies, but what about less well-behaved cases? Suppose we start 
with some technology V(y), possibly non-convex. We find its cost function c(w,y) and 
then generate V*(y). We know from the above results that V*(y) will not necessarily be 
equal to V(y), unless V(y) happens to have the convexity and monotonicity properties. 
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However, suppose we define 



c*(w, y) = minwx 
such that x is in V*(y). 

What is the relationship between c*(w,y) and c(w, y)? 

Proposition 3.7.2 (Equality of c(w, y) and c*(w,y)) It follows from the definition of 
the functions that c*(w,y) = c(w,y). 

Proof. It is easy to see that c*(w, y) ^ c(w, y); since V*(y) always contains V(y), the 
minimal cost bundle in V*(y) must be at least as small as the minimal cost bundle in 
V(y). Suppose that for some prices w' the cost-minimizing bundle x' in V*(y) has the 
property that w'x' = c*(w',y) < c(w',y). But this can't happen, since by definition of 
\/*(y),w'x'^c(w',y). 

This proposition shows that the cost function for the technology V(y) is the same as 
the cost function for its convexification V*(y). In this sense, the assumption of convex 
input requirement sets is not very restrictive from an economic point of view. 

Let us summarize the discussion to date: 

(1) Given a cost function we can define an input requirement set V*(y). 

(2) If the original technology is convex and monotonic, the constructed technology will 
be identical with the original technology. 

(3) If the original technology is non-convex or monotonic, the constructed input re- 
quirement will be a convexified, monotonized version of the original set, and, most 
importantly, the constructed technology will have the same cost function as the 
original technology. 

In conclusion, the cost function of a firm summarizes all of the economically relevant 
aspects of its technology. 

Example 3.7.1 (Recovering production from a cost function) Suppose we are giv- 
en a specific cost function c(w,y) = yw1w\~ a . How can we solve for its associated tech- 
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nology? According to the derivative property 

^i(w,y) = aywl~ 1 w\~ a = ay ( 
a?2(w, y) = (1 - a)yw^w 2 a = (1 - a)y 



w 2 



We want to eliminate W2/W1 from these two equations and get an equation for y in terms 
of X\ and x 2 - Rearranging each equation gives 



W2 / Xl 

W 2 ( X 2 



1 

l-a 



wi \( 1_a )y, 

Setting these equal to each other and raising both sides to the — a(l — a) power, 

x l _ x 2 

a -a^-a ~~ {I - aY l ~ a ^y 1 - a ' 

or, 

[a a (l - a) 1 "' 1 ]!/ = x\x\- a . 
This is just the Cobb-Douglas technology. 

We know that if the technology exhibited constant returns to scale, then the cost 
function would have the form c(w)y. Here we show that the reverse implication is also 
true. 

Proposition 3.7.3 (Constant returns to scale.) Let V(y) be convex and monotonic; 
then if c(w,y) can be written as yc(w), V(y) must exhibit constant returns to scale. 

Proof Using convexity, monotonicity, and the assumed form of the cost function as- 
sumptions, we know that 

V(y) = V*(y) = {x: w • x ^ yc(w) for all w ^ 0}. 

We want to show that, if x is in V*(y), then tx is in V*(ty). If x is in V*(y), we know 
that wx ^ yc(w) for all w ^ 0. Multiplying both sides of this equation by t we get: 
wtx ^ tyc(w) for all w ^ 0. But this says tx is in V*(ty). 
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3.7.2 Characterization of Cost Functions 

We have seen in the last section that all cost functions are nondecreasing, homogeneous, 
concave, continuous functions of prices. The question arises: suppose that you are given 
a nondecreasing, homogeneous, concave, continuous function of prices is it necessarily the 
cost function of some technology? The answer is yes, and the following proposition shows 
how to construct such a technology. 

Proposition 3.7.4 Let <fi(w,y) be a differentiable function satisfying 

(1) (f)(tw, y) = t<j){w, y) for all t ^ 0; 

(2) (f>(w,y) ^ for w ^ and y ^ 0; 

(3) 0(w',y) ^ </>(w,y) for w' ^ w; 

(4) 0(w,y) is concave inw. 

Then (fi(w,y) is the cost function for the technology defined by V*(y) = {x ^ 0: 
wx ^ 0(w, y), for all w ^ 0}. 

Proof. Given a w > we define 



and note that since 0(w,y) is homogeneous of degree 1 in w, Euler's law implies that 
<p(w,y) can be written as 



Note that the monotonicity of 0(w, y) implies x(w, y) ^ 0. 

What we need to show is that for any given w' ^ 0, x(w',?/) actually minimizes w'x 
over all x in V*(y): 

(j)(w',y) = w'x(w',y) ^ w'x for all x in V*(y). 

First, we show that x(w',y) is feasible; that is, x(w',y) is in V*(y). By the concavity of 
4>(w,y) in w we have 




n 



d<ft(w,y) 
dwi 



wx(w,y). 



i=i 



0(w', y) ^ 0(w, y) + D0(w, y)(w' - w) 
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for all w ^ 0. 

Using Euler's law as above, this reduces to 

(f)(w',y) < w'x(w, y) for all w ^ 0. 

It follows from the definition of V*(y), that x(w',y) is in V*(y). 

Next we show that x(w, y) actually minimizes wx over all x in V*(y). If x is in V*(y), 
then by definition it must satisfy 

wx ^ 0(w,y). 

But by Euler's law, 

cf)(w,y) = wx(w,y). 

The above two expressions imply 

wx ^ wx(w,y) 

for all x in V*(y) as required. 

3.7.3 The Integrability for Cost Functions 

The proposition proved in the last subsection raises an interesting question. Suppose 
you are given a set of functions (gi(w,y)) that satisfy the properties of conditional factor 
demand functions described in the previous sections, namely, that they are homogeneous 
of degree in prices and that 

( dgj(w,y) \ 
V dw 3 ) 

is a symmetric negative semi-definite matrix. Are these functions necessarily factor de- 
mand functions for some technology? 

Let us try to apply the above proposition. First, we construct a candidate for a cost 
function: 

n 

0( w >y) = ^2wi9i(w,y)- 

i=i 

Next, we check whether it satisfies the properties required for the proposition just 
proved. 
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1) Is (f)(w,y) homogeneous of degree 1 in w? To check this we look at (f>(tw,y) = 
J2itWigi(tw,y). Since the functions gi(w,y) are by assumption homogeneous of 
degree 0, gi(tw,y) = &(w,y) so that 

n 

(f)(tw,y) =t^2wgi(w,y) = t(f>(w,y). 
i=i 

2) Is <f>(w,y) ^ for w ^ 0? Since gi(w,y) ^ 0, the answer is clearly yes. 

3) Is 0(w, y) nondecreasing in w{! Using the product rule, we compute 

<9</>(w,y) -A %(w,y) -A dgi(w,y) 

= ^ (w ' y) + 2. ^— ^— = ^ f ) + L ^- ^— ■ 

J"=l J"=l J 

Since gi(w,y) is homogeneous of degree 0, the last term vanishes and gi(w,y) is 
clearly greater than or equal to 0. 

4) Finally is </>(w, y) concave in w? To check this we differentiate 0(w, y) twice to get 

<9 2 \ fdgi(w,y)' 



dwidvjj J \ dvjj 

For concavity we want these matrices to be symmetric and negative semi-definite, 
which they are by hypothesis. 

Hence, the proposition proved in the last subsection applies and there is a technolo- 
gy V*(y) that yields (<7i(w, y)) as its conditional factor demands. This means that the 
properties of homogeneity and negative semi-definiteness form a complete list of the re- 
strictions on demand functions imposed by the model of cost-minimizing behavior. 
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Chapter 4 

Choice Under Uncertainty 



4.1 Introduction 

Until now, we have been concerned with the behavior of a consumer under conditions 
of certainty. However, many choices made by consumers take place under conditions of 
uncertainty. In this chapter we explore how the theory of consumer choice can be used to 
describe such behavior. 

The board outline of this chapter parallels a standard presentation of microeconom- 
ic theory for deterministic situations. It first considers the problem of an individual 
consumer facing an uncertain environment. It shows how preference structures can be 
extended to uncertain situations and describes the nature of the consumer choice problem. 
We then processed to derive the expected utility theorem, a result of central importance. 
In the remaining sections, we discuss the concept of risk aversion, and extend the basic 
theory by allowing utility to depend on states of nature underlying the uncertainty as well 
as on the monetary payoffs. We also discuss the theory of subjective probability, which 
offers a way of modelling choice under uncertainty in which the probabilities of different 
risky alternatives are not given to the decision maker in any objective fashion. 
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4.2 Expected Utility Theory 



4.2.1 Lotteries 

The first task is to describe the set of choices facing the consumer. We shall imagine that 
the choices facing the consumer take the form of lotteries. Suppose there are S states. 
Associated with each state s is a probability p s representing the probability that the state 
s will occur and a commodity bundle x s representing the prize or reward that will be won 
if the state s occurs, where we have p s = and J^f =1 p s = 1. The prizes may be money, 
bundles of goods, or even further lotteries. A lottery is denoted by 



For instance, for two states, a lottery is given p o x © (1 — p) o y which means: "the 
consumer receives prize x with probability p and prize y with probability (1 — p)." Most 
situations involving behavior under risk can be put into this lottery framework. Lotteries 
are often represented graphically by a fan of possibilities as in Figure 4.1 below. 



A compound lottery is shown in Figure 4.2. This lottery is between two prizes: a 
lottery between x and y, and a bundle z. 

We will make several axioms about the consumers perception of the lotteries open to 
him. 

Al (Certainty). 1 o x © (1 — 1) o y ~ x. Getting a prize with probability one is 
equivalent to that prize. 

A2 (Independence of Order), pox® (I— p)oy~(l— p)oyQ,p°x. The consumer 
doesn't care about the order in which the lottery is described — only the prizes and the 
probabilities of winning those prizes matter. 



Pi o xi © p 2 o x 2 © . . . © ps ° %s- 




Figure 4.1: A lottery. 
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(a) Original 



(b) Equivalent 



Figure 4.2: Compound lottery. 

A3 (Compounding). qo[pox(B{l — p)oy)©(l — q)oy ~ (qp)ox(B{l — qp)°y. It is only 
the net probabilities of receiving the a reward that matters. It is a fundamental axiom 
used to reduce compound lotteries-by determining the overall probabilities associated 
with its components. This axiom sometimes called "reduction of compound lotteries." 

Under these assumptions we can define C, the space of lotteries available to the con- 
sumer. The consumer is assumed to have preferences on this lottery space: given any 
two lotteries, he can choose between them. As usual we will assume the preferences are 
complete, reflexive, and transitive so it is an ordering preference. 

The fact that lotteries have only two outcomes is not restrictive since we have allowed 
the outcomes to be further lotteries. This allows us to construct lotteries with arbitrary 
numbers of prizes by compounding two prize lotteries as shown in Figure 4.2. For example, 
suppose we want to represent a situation with three prizes x, y and z where the probability 
of getting each prize is one third. By the reduction of compound lotteries, this lottery is 
equivalent to the lottery 



2 

— o 
3 



1 1 

- ox® - oy 



1 

3° Z - 



According to the compounding axiom (A3) above, the consumer only cares about the net 
probabilities involved, so this is indeed equivalent to the original lottery. 

4.2.2 Expected Utility 

Under minor additional assumptions, the theorem concerning the existence of a utility 
function may be applied to show that there exists a continuous utility function u which 
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describes the consumer's preferences; that is, p o x © (1 — p) o y y q o w © (1 — q) o z if 
and only if 

M(p o x © (1 — p) o y) > -u(g o iu © (1 — q) o 2). 

Of course, this utility function is not unique; any monotonic transform would do as well. 
Under some additional hypotheses, we can find a particular monotonic transformation of 
the utility function that has a very convenient property, the expected utility property: 

u(p o x © (1 — p) o y) = pu(x) + (1 — p)u(y). 

The expected utility property says that the utility of a lottery is the expectation of 
the utility from its prizes and such an expected utility function is called von Neumann- 
Morgenstern utility function. To have a utility function with the above convenient 
property, we need the additional axioms: 

A4 (Continuity), {p in [0,1]: p o x © (/ — p) o y y z} and {p in [0, 1]: z >z 
pox©(l — p)o y}are closed sets for all x, y and z in C Axiom 4 states that preferences 
are continuous with respect to probabilities. 

A5 (Strong Independence), x ~ y implies poi®(l-p)oz~poi/©(!-p)oz. 
It says that lotteries with indifferent prizes are indifferent. 

In order to avoid some technical details we will make two further assumptions. 

A6 (Boundedness). There is some best lottery b and some worst lottery w. For any 
x in C, b y x y w. 

A7 (Monotonicity). A lottery p o b © (1 — p) o w is preferred to q o b © (1 — q) o w if 
and only if p > q. 

Axiom A7 can be derived from the other axioms. It just says that if one lottery 
between the best prize and the worse prize is preferred to another it must be because it 
gives higher probability of getting the best prize. 

Under these assumptions we can state the main theorem. 

Theorem 4.2.1 (Expected utility theorem) If (£, ~y) satisfy Axioms 1-7, there is a 
utility function u defined on C that satisfies the expected utility property: 

u(p oi©(l-p)oj/ = pu(x) + (1 — p)u(y) 
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Proof. Define u(b) = 1 and u(w) = 0. To find the utility of an arbitrary lottery z, set 
u(z) = p z where p z is defined by 

p z o b(B (I — p z ) ° u) ~ z. (4.1) 

In this construction the consumer is indifferent between z and a gamble between the best 
and the worst outcomes that gives probability p z of the best outcome. 
To ensure that this is well defined, we have to check two things. 

(1) Does p z exist? The two sets {p in [0, 1]: p o b ® (I — p) ° w y z} and {p in [0, 1]: 
z ^ p o 6 ® (1 - p) o to} are closed and nonempty by the continuity and boundedness 
axioms (A4 and A6), and every point in [0, 1] is in one or the other of the two sets. Since 
the unit interval is connected, there must be some p in both - but this will just be the 
desired p z . 

(2) Is p z unique? Suppose p z and p' z . are two distinct numbers and that each satisfies 
(4.1). Then one must be larger than the other. By the monotonicity axiom A7, the lottery 
that gives a bigger probability of getting the best prize cannot be indifferent to one that 
gives a smaller probability. Hence, p z is unique and u is well defined. 

We next check that u has the expected utility property. This follows from some simple 
substitutions: 

pox@(l — p)oy 

~i P o \p x o b © (1 - p x ) o w] © (1 - p) o \p y o b © (1 - p y ) o w] 

~2 [PPx + (1 - P)Py] O b © [1 - pp x - (1 - p)p y ] O W 

~3 \pu{x) + (1 — p)u(y)] o b © (1 — pu{x) — (1 — p)u(y)] o w. 

Substitution 1 uses the strong independence axiom (A5) and the definition of p z and p y . 
Substitution 2 uses the compounding axiom (A3), which says only the net probabilities 
of obtaining b or w matter. Substitution 3 uses the construction of the utility function. 
It follows from the construction of the utility function that 

u(p o x © (1 — p) o y) = pu(x) + (1 — p)u(y). 

Finally, we verify that u is a utility function. Suppose that x y y. Then 

u(x) = p z such that x ~ p x o b © (1 — p x ) o w 
u {y) — Py such that y ~ p y o b © (1 — p y ) o w 
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By the monotonicity axiom (A7), we must have u(x) > u(y). 

4.2.3 Uniqueness of the Expected Utility Function 

We have shown that there exists an expected utility function u : C — >■ R. Of course, any 
monotonic transformation of u will also be a utility function that describes the consumer's 
choice behavior. But will such a monotonic transform preserve the expected utility prop- 
erly? Does the construction described above characterize expected utility functions in 

any way? 

It is not hard to see that, if u(-) is an expected utility function describing some 
consumer, then so is v(-) = au(-) + c where a > 0; that is, any affine transformation of an 
expected utility function is also an expected utility function. This is clear since 

v(j) o x © (1 — p) o y) = au(p o x © (1 — p) o y) + c 

= a\pu(x) + (1 —p)u(y)\ + c 
= pv(x) + (l-p)v(y). 

It is not much harder to see the converse: that any monotonic transform of u that has 
the expected utility property must be an affine transform. Stated another way: 

Theorem 4.2.2 ( Uniqueness of expected utility function) An expected utility func- 
tion is unique up to an affine transformation. 

Proof. According to the above remarks we only have to show that, if a monotonic 
transformation preserves the expected utility property, it must be an affine transformation. 
Let / : R — > R be a monotonic transformation of u that has the expected utility property. 
Then 

f(u(p ox®(l-p)oy))= pf(u(x)) + (1 - p)f(u(y)), 

or 

f(pu(x) + (1 - p)u(y)) = pf(u(x)) + (1 - p)f(u(y)). 
But this is equivalent to the definition of an affine transformation. 
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4.2.4 Other Notations for Expected Utility 

We have proved the expected utility theorem for the case where there are two outcomes 
to the lotteries. As indicated earlier, it is straightforward to extend this proof to the case 
of a finite number of outcomes by using compound lotteries. If outcome Xi is received 
with probability pi for i — 1, . . . , n, the expected utility of this lottery is simply 



Subject to some minor technical details, the expected utility theorem also holds for 
continuous probability distributions. If p(x) is probability density function defined on 
outcomes x, then the expected utility of this gamble can be written as 



We can subsume of these cases by using the expectation operator. Let X be a random 
variable that takes on values denoted by x. Then the utility function of X is also a random 
variable, u(X). The expectation of this random variable Eu{X) is simply the expected 
utility associated with the lottery X. In the case of a discrete random variable, Eu(X) is 
given by (4.2), and in the case of a continuous random variable Eu(X) is given by (4.3). 

4.3 Risk aversion 

4.3.1 Absolute Risk Aversion 

Let us consider the case where the lottery space consists solely of gambles with money 
prizes. We have shown that if the consumer's choice behavior satisfies Axioms 1-7, we can 
use an expected utility function to represent the consumer's preferences on lotteries. This 
means that we can describe the consumer's behavior over all money gambles by expected 
utility function. For example, to compute the consumer's expected utility of a gamble 
p o x © (1 — p) o y, we just look at pu(x) + (1 — p)u(y). 

This construction is illustrated in Figure 4.3 for p — |. Notice that in this example 
the consumer prefers to get the expected value of the lottery. This is, the utility of the 
lottery u(p o x © (1 — p) o y) is less than the utility of the expected value of the lottery, 



n 




(4.2) 




(4.3) 
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px + (1 — p)y. Such behavior is called risk aversion. A consumer may also be risk loving; 
in such a case, the consumer prefers a lottery to its expected value. 



UTILITY 



u(j* + j/) 
lu(x) + ±u(y) 




WEALTH 



Figure 4.3: Expected utility of a gamble. 



If a consumer is risk averse over some region, the chord drawn between any two points 
of the graph of his utility function in this region must lie below the function. This is 
equivalent to the mathematical definition of a concave function. Hence, concavity of 
the expected utility function is equivalent to risk aversion. 

It is often convenient to have a measure of risk aversion. Intuitively, the more concave 
the expected utility function, the more risk averse the consumer. Thus, we might think 
we could measure risk aversion by the second derivative of the expected utility function. 
However, this definition is not invariant to changes in the expected utility function: if we 
multiply the expected utility function by 2, the consumer's behavior doesn't change, but 
our proposed measure of risk aversion does. However, if we normalize the second derivative 
by dividing by the first, we get a reasonable measure, known as the Arrow-Pratt measure 
of (absolute) risk aversion: 



Example 4.3.1 (Constant risk aversion) Suppose an individual has a constant risk 
aversion coefficient r. Then the utility function satisfies 

u"(x) = —ru'{x). 

One can easily check that all solutions are 

u{x) = -ae~ rx + b 
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r{w) 




where a and b are arbitrary. For u(x) to be increasing in x, we must take a > 0. 



Example 4.3.2 A woman with current wealth X has the opportunity to bet any amount 
on the occurrence of an event that she knows will occur with probability p. If she wagers 
w, she will receive 2w if the event occurs and if it does not. She has a constant risk 
aversion coefficient utility u(x) = —e~ rx with r > 0. How much should she wager? 
Her final wealth will be either X + w or X — w. Hence she solves 

max{jra(X + w) + (l - p)u(X - w)} = ma^{-pe- r{x+w) - (1 - p) e - rix ~ w) } 

w w 

Setting the derivative to zero yields 

(l-p)e rw =pe 

Hence, 

1 p 

w — — m 



—rw 



2r l-p 

Note that a positive wager will be made for p > 1/2. The wager decreases as the risk 
coefficient increases. Note also that in this case the results is independent of the initial 
wealth — a particular feature of this utility function. 

Example 4.3.3 (The demand for insurance) Suppose a consumer initially has mon- 
etary wealth W. There is some probability p that he will lose an amount L for example, 
there is some probability his house will burn down. The consumer can purchase insurance 
that will pay him q dollars in the event that he incurs this loss. The amount of money 
that he has to pay for q dollars of insurance coverage is nq; here n is the premium per 
dollar of coverage. 

How much coverage will the consumer purchase? We look at the utility maximization 
problem 

maxpu(W — L — nq + q) + (1 — p)u{W — %q). 
Taking the derivative with respect to q and setting it equal to zero, we find 

pu'(W - L + q*(l -tt))(1 -tt) - (1 -p)u'(W -irq*)ir = 

u'(W-L+(l-7T)q*) = (l-p) 7T 

u'(W— irq*) p 1 — it 
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If the event occurs, the insurance company receives nq — q dollars. If the event doesn't 
occur, the insurance company receives nq dollars. Hence, the expected profit of the 
company is 

(1 -p)irq-p(l -ir)q. 

Let us suppose that competition in the insurance industry forces these profits to zero. 
This means that 

—p(l - ir)q + (1 - p)nq = 0, 

from which it follows that it = p. 

Under the zero-profit assumption the insurance firm charges an actuarially fair premi- 
um: the cost of a policy is precisely its expected value, so that p = it. Inserting this into 
the first-order conditions for utility maximization, we find 

u'{W - L + (1 - rt)q*) = u'{W - irq*). 

If the consumer is strictly risk averse so that u"(W) < 0, then the above equation implies 

W - L + (1 - ir)q* = W - vrg* 

from which it follows that L = q*. Thus, the consumer will completely insure himself 
against the loss L. 

This result depends crucially on the assumption that the consumer cannot influence 
the probability of loss. If the consumer's actions do affect the probability of loss, the 
insurance firms may only want to offer partial insurance, so that the consumer will still 
have an incentive to be careful. 

4.3.2 Global Risk Aversion 

The Arrow-Pratt measure seems to be a sensible interpretation of local risk aversion: one 
agent is more risk averse than another if he is willing to accept fewer small gambles. 
However, in many circumstances we want a global measure of risk aversion-that is, we 
want to say that one agent is more risk averse than another for all levels of wealth. What 
are natural ways to express this condition? 
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The first plausible way is to formalize the notion that an agent with utility function 
A(w) is more risk averse than an agent with utility function B(w) is to require that 

A"{w) B"{w) 
~ A'(w) > ~ B'(w) 

for all levels of wealth w. This simply means that agent A has a higher degree of risk 
aversion than agent B everywhere. 

Another sensible way to formalize the notion that agent A is more risk averse than 
agent B is to say that agent A's utility function is "more concave" than agent -B's. More 
precisely, we say that agent A's utility function is a concave transformation of agent B's; 
that is, there exists some increasing, strictly concave function G(-) such that 

A(w) = G(B(w)). 

A third way to capture the idea that A is more risk averse than B is to say that A 
would be willing to pay more to avoid a given risk than B would. In order to formalize 
this idea, let ?be a random variable with expectation of zero: Ee = 0. Then define 7r^(e) 
to be the maximum amount of wealth that person A would give up in order to avoid 
facing the random variable ef. In symbols, this risk premium is 

A(w - 7r A (e)) = EA(w + e). 

The left-hand side of this expression is the utility from having wealth reduced by tta(^) 
and the right-hand side is the expected utility from facing the gamble ?. It is natural to 
say that person A is (globally) more risk averse than person B if 7r^(e) > ir B (e) for all e 
and w. 

It may seem difficult to choose among these three plausible sounding interpretations of 
what it might mean for one agent to be "globally more risk averse" than another. Luckily, 
it is not necessary to do so: all three definitions turn out to be equivalent! As one step 
in the demonstration of this fact we need the following result, which is of great use in 
dealing with expected utility functions. 

Lemma 4.3.1 ( Jensen's inequality) Let X be a nondegenerate random variable and 
f(X) be a strictly concave function of this random variable. Then Ef(X) < f(EX). 
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Proof. This is true in general, but is easiest to prove in the case of a differentiable 
concave function. Such a function has the property that at any point x, f(x) < fix) + 
f(x)(x — x). Let X be the expected value of X and take expectations of each side of this 
expression, we have 



Theorem 4.3.1 (Pratt's theorem) Let A(w) and B(w) be two differentiable, increas- 
ing and concave expected utility functions of wealth. Then the following properties are 
equivalent. 

1. -A"(w)/A'(w) > -B"(w)/B'(w) for allw. 

2. A{w) = G(B(w)) for some increasing strictly concave function G. 

3. 71^4 (?) > n B (e) for all random variables e with Ee = 0. 

Proof 

(1) implies (2). Define G(B) implicitly by A(w) = G(B(w)). Note that monotonicity 
of the utility functions implies that G is well defined i.e., that there is a unique value of 
G(B) for each value of B. Now differentiate this definition twice to find 



Since A'{w) > and B'{w) > 0, the first equation establishes G'(B) > 0. Dividing 
the second equation by the first gives us 



Ef(X) < f(X) + f'(X)E(X - X) = f(X) : 



from which it follows that 



Ef{X) < f\X) = f(EX). 



A'(w) 
A"(w) 



G\B)B\w) 

G"(B)B'(w) 2 + G'(B)B"(w). 



A"(w) _ G"(B) 
A'{w) ~ G'(B) 



B'(w) + 



B"{w) 
B'[w) ' 



Rearranging gives us 




where the inequality follows from (1). This shows that G"(B) < 0, as required. 
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(2) implies (3). This follows from the following chain of inequalities: 

A(w-tt a ) = EA(w + e) = EG(B(w + e)) 

< G(EB(w + T}) = G(B(w-7r B )) 
= A(w-ir B ). 

All of these relationships follow from the definition of the risk premium except for the 
inequality, which follows from Jensen's inequality. Comparing the first and the last terms, 
we see that ~ka > ti\b- 

(3) implies (1). Since (3) holds for all zero-mean random variables e, it must hold for 
arbitrarily small random variables. Fix an ?, and consider the family of random variables 
defined by t?for t in [0, 1]. Let n(t) be the risk premium as a function of t. The second- 
order Taylor series expansion of n(t) around t = is given by 

?r(f) « tt(0) + tt(0)* + 7i + -7r"(0)t 2 . (4.4) 

We will calculate the terms in this Taylor series in order to see how ix{t) behaves for small 
t. The definition of ir(t) is 

A(w - n(t)) = EA(w + et). 

It follows from this definition that 7r(0) = 0. Differentiating the definition twice with 
respect to t gives us 

-A'(w-n(t)W(t) = E[A'(w + te)e\ 
A"(w - n(t))Tr'(t) 2 - A'(w - n(t))ir"(t) = E[A"(w + te)e\. 

Evaluating the first expression when t — 0, we see that 7r'(0) = . Evaluating the 
second expression when t — 0, we see that 

EA"(w)P A"(w) 2 

where a 2 is the variance of e. Plugging the derivatives into equation (4.4) for n(t), we 
have 

A"(w) a 2 2 

This implies that for arbitrarily small values of t, the risk premium depends monotonically 
on the degree of risk aversion, which is what we wanted to show. 
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4.3.3 Relative Risk Aversion 

Consider a consumer with wealth w and suppose that she is offered gambles of the form: 
with probability p she will receive x percent of her current wealth; with probability (1—p) 
she will receive y percent of her current wealth. If the consumer evaluates lotteries using 
expected utility, the utility of this lottery will be 

pu(xw) + (1 —p)u(yw). 

Note that this multiplicative gamble has a different structure than the additive gambles 
analyzed above. Nevertheless, relative gambles of this sort often arise in economic prob- 
lems. For example, the return on investments is usually stated relative to the level of 
investment. 

Just as before we can ask when one consumer will accept more small relative gambles 
than another at a given wealth level. Going through the same sort of analysis used above, 
we find that the appropriate measure turns out to be the Arrow-Pratt measure of relative 
risk aversion: 

u"{w)w 
^ u'(w) 

It is reasonable to ask how absolute and relative risk aversions might vary with wealth. 
It is quite plausible to assume that absolute risk aversion decreases with wealth: as you 
become more wealthy you would be willing to accept more gambles expressed in absolute 
dollars. The behavior of relative risk aversion is more problematic; as your wealth increases 
would you be more or less willing to risk losing a specific fraction of it? Assuming constant 
relative risk aversion is probably not too bad an assumption, at least for small changes in 
wealth. 

Example 4.3.4 (Mean-variance utility) In general the expected utility of a gamble 
depends on the entire probability distribution of the outcomes. However, in some circum- 
stances the expected utility of a gamble will only depend on certain summary statistics of 
the distribution. The most common example of this is a mean-variance utility function. 

For example, suppose that the expected utility function is quadratic, so that u(w) = 
w — bw 2 . Then expected utility is 

Eu(w) = Ew - bEw 2 = w- bw 2 - bo 2 w . 
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Hence, the expected utility of a gamble is only a function of the mean and variance of 
wealth. 

Unfortunately, the quadratic utility function has some undesirable properties: it is 
a decreasing function of wealth in some ranges, and it exhibits increasing absolute risk 
aversion. 

A more useful case when mean- variance analysis is justified is the case when wealth is 
Normally distributed. It is well-known that the mean and variance completely characterize 
a Normal random variable; hence, choice among Normally distributed random variables 
reduces to a comparison on their means and variances. One particular case that is of 
special interest is when the consumer has a utility function of the form u(w) = —e~ rw 
which exhibits constant absolute risk aversion. Furthermore, when wealth is Normally 
distributed 

Eu(w) = - J e- rw fz(w)dw = - e -r[™-™H 2 ]. 

(To do the integration, either complete the square or else note that this is essentially 
the calculation that one does to find the moment generating function for the Normal 
distribution.) Note that expected utility is increasing in w — ra 2 w j2. This means that 
we can take a monotonic transformation of expected utility and evaluate distributions 
of wealth using the utility function u(iu,cr^) = w — \o 2 w . This utility function has the 
convenient property that it is linear in the mean and variance of wealth. 

4.4 State Dependent Utility 

In our original analysis of choice under uncertainty, the prizes were simply abstract bundles 
of goods; later we specialized to lotteries with only monetary outcomes when considering 
the risk aversion issue. However, this is restrictive. After all, a complete description of 
the outcome of a dollar gamble should include not only the amount of money available in 
each outcome but also the prevailing prices in each outcome. 

More generally, the usefulness of a good often depends on the circumstances or state 
of nature in which it becomes available. An umbrella when it is raining may appear very 
different to a consumer than an umbrella when it is not raining. These examples show 
that in some choice problems it is important to distinguish goods by the state of nature 
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in which they are available. 

For example, suppose that there are two states of nature, hot and cold, which we 
index by h and c. Let x h be the amount of ice cream delivered when it is hot and x c the 
amount delivered when it is cold. Then if the probability of hot weather is p, we may 
write a particular lottery as pu(h, Xh, ) + (1 — p)u(c, x c ). Here the bundle of goods that is 
delivered in one state is "hot weather and Xh units of ice cream," and "cold weather and 
x c , units of ice cream" in the other state. 

A more serious example involves health insurance. The value of a dollar may well 
depend on one's health - how much would a million dollars be worth to you if you were 
in a coma? In this case we might well write the utility function as u(h,m h ) where h is 
an indicator of health and m is some amount of money. These are all examples of state- 
dependent utility functions. This simply means that the preferences among the goods 
under consideration depend on the state of nature under which they become available. 

4.5 Subjective Probability Theory 

In the discussion of expected utility theory we have used "objective" probabilities - 
such as probabilities calculated on the basis of some observed frequencies and asked what 
axioms about a person's choice behavior would imply the existence of an expected utility 
function that would represent that behavior. 

However, many interesting choice problems involve subjective probabilities: a given 
agent's perception of the likelihood of some event occurring. Similarly, we can ask what 
axioms about a person's choice behavior can be used to infer the existence of subjective 
probabilities; i.e., that the person's choice behavior can be viewed as if he were evaluating 
gambles according to their expected utility with respect to some subjective probability 
measures. 

As it happens, such sets of axioms exist and are reasonably plausible. Subjective 
probabilities can be constructed in a way similar to the manner with which the expected 
utility function was constructed. Recall that the utility of some gamble x was chosen to 
be that number u(x) such that 

x ~ u{x) o b © (1 — u(x)) o w. 
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Suppose that we are trying to ascertain an individual's subjective probability that it 
will rain on a certain date. Then we can ask at what probability p will the individual be 
indifferent between the gamble po & © (1 - p) o w and the gamble "Receive b if it rains 
and w otherwise." 

More formally, let E be some event, and let p(E) stand for the (subjective) probability 
that E will occur. We define the subjective probability that E occurs by the number p(E) 
that satisfies p(E) o b © +(1 — p(E)) o w ~ receive b if E occurs and w otherwise. 

It can be shown that under certain regularity assumptions the probabilities defined in 
this way have all of the properties of ordinary objective probabilities. In particular, they 
obey the usual rules for manipulation of conditional probabilities. This has a number of 
useful implications for economic behavior. We will briefly explore one such implication. 
Suppose that p(H) is an individual's subjective probability that a particular hypothesis 
is true, and that E is an event that is offered as evidence that H is true. How should a 
rational economic agent adjust his probability belief about H in light of the evidence El 
That is, what is the probability of H being true, conditional on observing the evidence 
El 

We can write the joint probability of observing E and H being true as 

p(H, E) = p(H\E)p(E) = p{E\H)p{H). 
Rearranging the right-hand sides of this equation, 

This is a form of Bayes' law which relates the prior probability p(H), the probability 
that the hypothesis is true before observing the evidence, to the posterior probability, the 
probability that the hypothesis is true after observing the evidence. 

Bayes' law follows directly from simple manipulations of conditional probabilities. If 
an individual's behavior satisfies restrictions sufficient to ensure the existence of subjective 
probabilities, those probabilities must satisfy Bayes' law. Bayes' law is important since it 
shows how a rational individual should update his probabilities in the light of evidence, 
and hence serves as the basis for most models of rational learning behavior. 

Thus, both the utility function and the subjective probabilities can be constructed 
from observed choice behavior, as long as the observed choice behavior follows certain 
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intuitively plausible axioms. However, it should be emphasized that although the axioms 
are intuitively plausible it does not follow that they are accurate descriptions of how 
individuals actually behave. That determination must be based on empirical evidence. 
Expected utility theory and subjective probability theory were motivated by consider- 
ations of rationality. The axioms underlying expected utility theory seem plausible, as 
does the construction that we used for subjective probabilities. 

Unfortunately, real-life individual behavior appears to systematically violate some of 
the axioms. Here we present two famous examples. 

Example 4.5.1 (The Allais paradox) You are asked to choose between the following 

two gambles: 

Gamble A. A 100 percent chance of receiving 1 million. 

Gamble B. A 10 percent chance of 5 million, an 89 percent chance of 1 million, and 
a 1 percent chance of nothing. 

Before you read any further pick one of these gambles, and write it down. Now consider 
the following two gambles. 

Gamble C. An 11 percent chance of 1 million, and an 89 percent chance of nothing. 

Gamble D. A 10 percent chance of 5 million, and a 90 percent chance of nothing. 

Again, please pick one of these two gambles as your preferred choice and write it down. 
Many people prefer A to B and D to C. However, these choices violate the expected utility 
axioms! To see this, simply write the expected utility relationship implied by A y B: 

u(l) > .lu(5) + .89u(l) + .01u(0). 
Rearranging this expression gives 

.llu(l) > .lu(5) + .01u(0), 
and adding .89w(0) to each side yields 

.llu(l) + .89u(0) > .lu(5) + .90u(0), 
It follows that gamble C must be preferred to gamble D by an expected utility maximizer. 



156 



Example 4.5.2 (The Ellsberg paradox) The Ellsberg paradox concerns subjective prob- 
ability theory. You are told that an urn contains 300 balls. One hundred of the balls are 
red and 200 are either blue or green. 

Gamble A. You receive$l, 000 if the ball is red. 

Gamble B. You receive $1, 000 if the ball is blue. 

Write down which of these two gambles you prefer. Now consider the following two 
gambles: 

Gamble C. You receive $1, 000 if the ball is not red. 
Gamble D. You receive $1, 000 if the ball is not blue. 

It is common for people to strictly prefer A to B and C to D. But these preferences 
violate standard subjective probability theory. To see why, let R be the event that the 
ball is red, and -~R be the event that the ball is not red, and define B and ->B accordingly. 
By ordinary rules of probability, 



p(R) = l-p(->R) (4.5) 
p(B) = 1-p^B). 

Normalize -u(0) = for convenience. Then if A is preferred to B, we must have p(R)u(1000) > 
p(-B)w(1000), from which it follows that 

p(R)>p(B). (4.6) 

If C is preferred to D, we must have p(-i_R)-u(1000) > p(^B)u(1000) , from which it follows 
that 

p^R)>p^B). (4.7) 

However, it is clear that expressions (4.5), (4.6), and (4.7) are inconsistent. 

The Ellsberg paradox seems to be due to the fact that people think that betting for 
or against R is "safer" than betting for or against "blue." 

Opinions differ about the importance of the Allais paradox and the Ellsberg paradox. 
Some economists think that these anomalies require new models to describe people's 
behavior. Others think that these paradoxes are akin to "optical illusions." Even though 
people are poor at judging distances under some circumstances doesn't mean that we 
need to invent a new concept of distance. 
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Part II 

Strategic Behavior and Markets 
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We have restricted ourselves until now to the ideal situation (benchmark case) where 
the behavior of the others are summarized in non-individualized parameters - the prices 
of commodities, each individual makes decision independently by taking prices as given 
and individuals' behavior are indirectly interacted through prices. This is clearly a very 
restricted assumption. In many cases, one individual's action can directly affect the 
actions of the others, and also are affected by the others's actions. Thus, it is very 
important to study the realistic case where individuals' behavior are affected each other 
and they interact each other. The game theory is thus a powerful tool to study individuals' 
cooperation and solving possible conflicts. 

In this part, we will first discuss the game theory and then the various types of market 
structures where one individual's decision may affect the decisions of the others. It is 
greatly acknowledged that the materials are largely drawn from the references in the end 
of each charter, especially from Varian (1992). 
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Chapter 5 
Game Theory 



5.1 Introduction 

Game theory is the study of interacting decision makers. In earlier chapters we studied 
the theory of optimal decision making by a single agent-firm or a consumer-in very simple 
environments. The strategic interactions of the agents were not very complicated. In this 
chapter we will lay the foundations for a deeper analysis of the behavior of economic 
agents in more complex environments. 

There are many directions from which one could study interacting decision makers. 
One could examine behavior from the viewpoint of sociology, psychology, biology, etc. 
Each of these approaches is useful in certain contexts. Game theory emphasizes a study 
of cold-blooded "rational" decision making, since this is felt to be the most appropriate 
model for most economic behavior. 

Game theory has been widely used in economics in the last two decade, and much 
progress has been made in clarifying the nature of strategic interaction in economic mod- 
els. Indeed, most economic behavior can be viewed as special cases of game theory, and 
a sound understanding of game theory is a necessary component of any economist's set 
of analytical tools. 



161 



5.2 Description of a game 

There are several ways of describing a game. For our purposes, the strategic form and 
the extensive form will be sufficient. Roughly speaking the extensive form provides an 
"extended" description of a game while the strategic form provides a "reduced" summary 
of a game. We will first describe the strategic form, reserving the discussion of the 
extensive form for the section on sequential games. 

5.2.1 Strategic Form 

The strategic form of the game is defined by exhibiting a set of players N = {1,2, . . . ,n}. 
Each player % has a set of strategies Si from which he/she can choose an action G Si and 
a payoff function, <f>i(s), that indicate the utility that each player receives if a particular 
combination s of strategies is chosen, where s = (si, S2, ■ ■ ■ , s n ) G S = Y\7=i^i- For 
purposes of exposition, we will treat two-person games in this chapter. All of the concepts 
described below can be easily extended to multi-person contexts. 

We assume that the description of the game - the payoffs and the strategies available 
to the players - are common knowledge. That is, each player knows his own payoffs 
and strategies, and the other player's payoffs and strategies. Furthermore, each player 
knows that the other player knows this, and so on. We also assume that it is common 
knowledge that each player is "fully rational." That is, each player can choose an action 
that maximizes his utility given his subjective beliefs, and that those beliefs are modified 
when new information arrives according to Bayes' law. 

Game theory, by this account, is a generalization of standard, one-person decision the- 
ory. How should a rational expected utility maximizer behave in a situation in which his 
payoff depends on the choices of another rational expected utility maximizer? Obviously, 
each player will have to consider the problem faced by the other player in order to make 
a sensible choice. We examine the outcome of this sort of consideration below. 

Example 5.2.1 (Matching pennies) In this game, there are two players, Row and 
Column. Each player has a coin which he can arrange so that either the head side or the 
tail side is face-up. Thus, each player has two strategies which we abbreviate as Heads 
or Tails. Once the strategies are chosen there are payoffs to each player which depend on 
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the choices that both players make. 

These choices are made independently, and neither player knows the other's choice 
when he makes his own choice. We suppose that if both players show heads or both show 
tails, then Row wins a dollar and Column loses a dollar. If, on the other hand, one player 
exhibits heads and the other exhibits tails, then Column wins a dollar and Row looses a 
dollar. 

Column 





Heads 


Tails 


Row Heads 


(1,-1) 


(-1,1) 


Tails 


(-1,1) 


(1,-1) 



Table 5.1: Game Matrix of Matching Pennies 

We can depict the strategic interactions in a game matrix. The entry in box (Head, 
Tails) indicates that player Row gets —1 and player Column gets +1 if this particular 
combination of strategies is chosen. Note that in each entry of this box, the payoff to 
player Row is just the negative of the payoff to player Column. In other words, this is a 
zero-sum game. In zero-sum games the interests of the players are diametrically opposed 
and are particularly simple to analyze. However, most games of interest to economists 
are not zero sum games. 

Example 5.2.2 (The Prisoners Dilemma) Again we have two players, Row and Col- 
umn, but now their interests are only partially in conflict. There are two strategies: to 
Cooperate or to Defect. 

In the original story, Row and Column were two prisoners who jointly participated 
in a crime. They could cooperate with each other and refuse to give evidence (i.e., do 
not confess), or one could defect (i.e, confess) and implicate the other. They are held in 
separate cells, and each is privately told that if he is the only one to confess, then he will 
be rewarded with a light sentence of 1 year while the recalcitrant prisoner will go to jail 
for 10 years. However, if the person is the only one not to confess, then it is the who will 
serve the 10-year sentence. If both confess, they will both be shown some mercy: they 
will each get 5 years. Finally, if neither confesses, it will still possible to convict both of 
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a lesser crime that carries a sentence of 2 years. Each player wishes to minimize the time 
he spends in jail. The outcome can be shown in Table 5.2. 

Prisoner 2 
Don't Confess Confess 



Prisoner 1 Don't Confess 


(-2, 


-2) 


(-10, 


-1) 


Don't confess 


(-1,- 


-10) 


(-5,- 


-5) 



Table 5.2: The Prisoner's Dilemma 



The problem is that each party has an incentive to confess, regardless of what he or 
she believes the other party will do. In this prisoner's dilemma, "confession" is the best 
strategy to each prisoner regardless the choice of the other. 

An especially simple revised version of the above prisoner's dilemma given by Aumann 
(1987) is the game in which each player can simply announce to a referee: "Give me 
$1,000," or "Give the other player $3,000." Note that the monetary payments come 
from a third party, not from either of the players; the Prisoner's Dilemma is a variable- 
sum game. 

The players can discuss the game in advance but the actual decisions must be indepen- 
dent. The Cooperate strategy is for each person to announce the $3, 000 gift, while the 
Defect strategy is to take the $1, 000 (and run!). Table 5.3 depicts the payoff matrix to the 
Aumann version of the Prisoner's Dilemma, where the units of the payoff are thousands 
of dollars. 

Column 





Cooperate 


Defect 


Row Cooperate 


(3,3) 


(0,4) 


Defect 


(4,0) 


(1,1) 



Table 5.3: A Revised Version of Prisoner's Dilemma by Aumann 

We will discuss this game in more detail below. Again, each party has an incentive to 
defect, regardless of what he or she believes the other party will do. For if I believe that 
the other person will cooperate and give me a $3, 000 gift, then I will get $4, 000 in total 
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by defecting. On the other hand, if I believe that the other person will defect and just 
take the $1, 000, then I do better by taking $1, 000 for myself. 

In other applications, cooperate and defect could have different meanings. For exam- 
ple, in a duopoly situation, cooperate could mean "keep charging a high price" and defect 
could mean "cut your price and steal your competitor's market." 

Example 5.2.3 (Cournot Duopoly) Consider a simple duopoly game, first analyzed 
by Cournot (1838). We suppose that there are two firms who produce an identical good 
with a marginal cost c. Each firm must decide how much output to produce without 
knowing the production decision of the other duopolist. If the firms produce a total of x 
units of the good, the market price will be p(x); that is, p(x) is the inverse demand curve 
facing these two producers. 

If Xi is the production level of firm i, the market price will then be p{x\ + £2), and the 
profits of firm i are given by ■K i {p{xi + x 2 ) — c)#j. In this game the strategy of firm % is 
its choice of production level and the payoff to firm i is its profits. 

Example 5.2.4 (Bertrand duopoly) Consider the same setup as in the Cournot game, 
but now suppose that the strategy of each player is to announce the price at which he 
would be willing to supply an arbitrary amount of the good in question. In this case 
the payoff function takes a radically different form. It is plausible to suppose that the 
consumers will only purchase from the firm with the lowest price, and that they will split 
evenly between the two firms if they charge the same price. Letting x{p) represent the 
market demand function and c the marginal cost, this leads to a payoff to firm 1 of the 
form: 

f 

(pi - c)x(pi) if pi < p 2 
(pi - c)x(p 1 )/2 if pi=p 2 ■ 
if Pi> P2 

This game has a similar structure to that of the Prisoner's Dilemma. If both players 
cooperate, they can charge the monopoly price and each reap half of the monopoly profits. 
But the temptation is always there for one player to cut its price slightly and thereby 
capture the entire market for itself. But if both players cut price, then they are both 
worse off. 



7Tl(Pl,P2) = < 
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Note that the Cournot game and the Bertrand game have a radically different struc- 
ture, even though they purport to model the same economic phenomena - a duopoly. 
In the Cournot game, the payoff to each firm is a continuous function of its strategic 
choice; in the Bertrand game, the payoffs are discontinuous functions of the strategies. 
As might be expected, this leads to quite different equilibria. Which of these models is 
reasonable? The answer is that it depends on what you are trying to model. In most 
economic modelling, there is an art to choosing a representation of the strategy choices 
of the game that captures an element of the real strategic iterations, while at the same 
time leaving the game simple enough to analyze. 

5.3 Solution Concepts 

5.3.1 Mixed Strategies and Pure Strategies 

In many games the nature of the strategic interaction suggests that a player wants to 
choose a strategy that is not predictable in advance by the other player. Consider, for 
example, the Matching Pennies game described above. Here it is clear that neither player 
wants the other player to be able to predict his choice accurately. Thus, it is natural to 
consider a random strategy of playing heads with some probability ph and tails with some 
probability p t . Such a strategy is called a mixed strategy. Strategies in which some 
choice is made with probability 1 are called pure strategies. 

If R is the set of pure strategies available to Row, the set of mixed strategies open 
to Row will be the set of all probability distributions over R, where the probability of 
playing strategy r in R is p r . Similarly, p c , will be the probability that Column plays some 
strategy c. In order to solve the game, we want to find a set of mixed strategies (p r ,Pc) 
that are, in some sense, in equilibrium. It may be that some of the equilibrium mixed 
strategies assign probability 1 to some choices, in which case they are interpreted as pure 
strategies. 

The natural starting point in a search for a solution concept is standard decision 
theory: we assume that each player has some probability beliefs about the strategies that 
the other player might choose and that each player chooses the strategy that maximizes 
his expected payoff. 
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Suppose for example that the payoff to Row is u r (r, c) if Row plays r and Column plays 
c. We assume that Row has a subjective probability distribution over Column's choices 
which we denote by (ir c ); see Chapter 4 for the fundamentals of the idea of subjective 
probability. Here ir c is supposed to indicate the probability, as envisioned by Row, that 
Column will make the choice c. Similarly, Column has some beliefs about Row's behavior 
that we can denote by (tt c ). 

We allow each player to play a mixed strategy and denote Row's actual mixed strategy 
by (p r ) and Column's actual mixed strategy by (p c ). Since Row makes his choice without 
knowing Column's choice, Row's probability that a particular outcome (r, c) will occur is 
p r n c . This is simply the (objective) probability that Row plays r times Row's (subjec- 
tive) probability that Column plays c. Hence, Row's objective is to choose a probability 
distribution (p r ) that maximizes 

Row's expected payoff = J2 r J2 c p r n c Ur( r , c )- 
Column, on the other hand, wishes to maximize 

Column's expected payoff = J2c^2rPc n rU c (r, c). 

So far we have simply applied a standard decision-theoretic model to this game - each 
player wants to maximize his or her expected utility given his or her beliefs. Given my 
beliefs about what the other player might do, I choose the strategy that maximizes my 
expected utility. 

5.3.2 Nash equilibrium 

In the expected payoff formulas given at the end of the last subsection, Row's behavior 
- how likely he is to play each of his strategies represented by the probability distribu- 
tion (p r ) and Column's beliefs about Row's behavior are represented by the (subjective) 
probability distribution (n r ). 

A natural consistency requirement is that each player's belief about the other player's 
choices coincides with the actual choices the other player intends to make. Expectations 
that are consistent with actual frequencies are sometimes called rational expectations. A 
Nash equilibrium is a certain kind of rational expectations equilibrium. More formally: 

Definition 5.3.1 (Nash Equilibrium in Mixed Strategies.) A Nash equilibrium in 
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mixed strategies consists of probability beliefs (n r , 7r c ) over strategies, and probability of 
choosing strategies (p r ,Pc), such that: 

1. the beliefs are correct: p r = n r and p c = n c for all r and c; and, 

2. each player is choosing (p r ) and (p c ) so as to maximize his expected utility given 
his beliefs. 

In this definition a Nash equilibrium in mixed strategies is an equilibrium in actions and 
beliefs. In equilibrium each player correctly foresees how likely the other player is to make 
various choices, and the beliefs of the two players are mutually consistent. 

A more conventional definition of a Nash equilibrium in mixed strategies is that it is 
a pair of mixed strategies (p r ,p c ) such that each agent's choice maximizes his expected 
utility, given the strategy of the other agent. This is equivalent to the definition we 
use, but it is misleading since the distinction between the beliefs of the agents and the 
actions of the agents is blurred. We've tried to be very careful in distinguishing these two 
concepts. 

One particularly interesting special case of a Nash equilibrium in mixed strategies 
is a Nash equilibrium in pure strategies, which is simply a Nash equilibrium in mixed 
strategies in which the probability of playing a particular strategy is 1 for each player. 
That is: 

Definition 5.3.2 (Nash equilibrium in Pure Strategies.) A Nash equilibrium in pure 
strategies is a pair (r*,c*) such that u r (r*,c*) ^ u r (r,c*) for all Row strategies r, and 
u c (r*,c*) ^ u c (r*,c) for all Column strategies c. 

A Nash equilibrium is a minimal consistency requirement to put on a pair of strategies: 
if Row believes that Column will play c*, then Row's best reply is r* and similarly for 
Column. No player would find it in his or her interest to deviate unilaterally from a Nash 
equilibrium strategy. 

If a set of strategies is not a Nash equilibrium then at least one player is not consistently 
thinking through the behavior of the other player. That is, one of the players must expect 
the other player not to act in his own self-interest - contradicting the original hypothesis 
of the analysis. 
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An equilibrium concept is often thought of as a "rest point" of some adjustment 
process. One interpretation of Nash equilibrium is that it is the adjustment process of 
"thinking through" the incentives of the other player. Row might think: "If I think that 
Column is going to play some strategy c\ then the best response for me is to play r\. But 
if Column thinks that I will play r\, then the best thing for him to do is to play some 
other strategy c 2 . But if Column is going to play c 2 , then my best response is to play 
r 2 ..." and so on. 

Example 5.3.1 (Nash equilibrium of Battle of the Sexes) The following game is 
known as the "Battle of the Sexes." The story behind the game goes something like 
this. Rhonda Row and Calvin Column are discussing whether to take microeconomics 
or macroeconomics this semester. Rhonda gets utility 2 and Calvin gets utility 1 if they 
both take micro; the payoffs are reversed if they both take macro. If they take different 
courses, they both get utility 0. 

Let us calculate all the Nash equilibria of this game. First, we look for the Nash 
equilibria in pure strategies. This simply involves a systematic examination of the best 
responses to various strategy choices. Suppose that Column thinks that Row will play 
Top. Column gets 1 from playing Left and from playing Right, so Left is Column's best 
response to Row playing Top. On the other hand, if Column plays Left, then it is easy 
to see that it is optimal for Row to play Top. This line of reasoning shows that (Top, 
Left) is a Nash equilibrium. A similar argument shows that (Bottom, Right) is a Nash 
equilibrium. 

Calvin 





Left (micro) 


Right (macro) 


Rhonda Top (micro) 


(2,1) 


(0,0) 


Bottom (macro) 


(0,0) 


(1,2) 



Table 5.4: Battle of the Sexes 

We can also solve this game systematically by writing the maximization problem 
that each agent has to solve and examining the first-order conditions. Let (pt,Pb) be 
the probabilities with which Row plays Top and Bottom, and define (pi,p r ) in a similar 
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manner. Then Row's problem is 

max p t [pi2 + p r 0] + p b [piO + p r l] 

(pt,Pb) 

such that 

Pt ^ 
Pb ^ 0. 

Let A, fit, and fib be the Kuhn- Tucker multipliers on the constraints, so that the La- 
grangian takes the form: 

C = 2p t pi + p b p r - X(p t +p b -l)~ fl t pt - fl b p b . 

Differentiating with respect to p t and p b , we see that the Kuhn- Tucker conditions for Row 
are 

2pi = A + fit 

Pr = \ + fi b (5.1) 

Since we already know the pure strategy solutions, we only consider the case where p t > 
and pb > 0. The complementary slackness conditions then imply that , fi t — fib — 0. 
Using the fact that pi +p r — 1, we easily see that Row will find it optimal to play a mixed 
strategy when p t = 1/3 and p r = 2/3. 

Following the same procedure for Column, we find that p t = 2/3 and p b = 1/3. 
The expected payoff to each player from this mixed strategy can be easily computed by 
plugging these numbers into the objective function. In this case the expected payoff is 2/3 
to each player. Note that each player would prefer either of the pure strategy equilibria 
to the mixed strategy since the payoffs are higher for each player. 

Remark 5.3.1 One disadvantage of the notion of a mixed strategy is that it is sometimes 
difficult to give a behavioral interpretation to the idea of a mixed strategy although 
a mixed strategies are the only sensible equilibrium for some games such as Matching 
Pennies. For example, a duopoly game - mixed strategies seem unrealistic. 
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5.3.3 Dominant strategies 

Let r\ and r 2 be two of Row's strategies. We say that r\ strictly dominates r 2 for Row if 
the payoff from strategy r\ is strictly larger than the payoff for r 2 no matter what choice 
Column makes. The strategy r\ weakly dominates r 2 if the payoff from r\ is at least as 
large for all choices Column might make and strictly larger for some choice. 

A dominant strategy equilibrium is a choice of strategies by each player such 
that each strategy (weakly) dominates every other strategy available to that player. One 
particularly interesting game that has a dominant strategy equilibrium is the Prisoner's 
Dilemma in which the dominant strategy equilibrium is (confess, confess). If I believe that 
the other agent will not confess, then it is to my advantage to confess; and if I believe 
that the other agent will confess, it is still to my advantage to confess. 

Clearly, a dominant strategy equilibrium is a Nash equilibrium, but not all Nash 
equilibria are dominant strategy equilibria. A dominant strategy equilibrium, should one 
exist, is an especially compelling solution to the game, since there is a unique optimal 
choice for each player. 

5.4 Repeated games 

In many cases, it is not appropriate to expect that the outcome of a repeated game with 
the same players as simply being a repetition of the one-shot game. This is because the 
strategy space of the repeated game is much larger: each player can determine his or her 
choice at some point as a function of the entire history of the game up until that point. 
Since my opponent can modify his behavior based on my history of choices, I must take 
this influence into account when making my own choices. 

Let us analyze this in the context of the simple Prisoner's Dilemma game described 
earlier. Here it is in the "long-run" interest of both players to try to get to the (Cooperate, 
Cooperate) solution. So it might be sensible for one player to try to "signal" to the other 
that he is willing to "be nice" and play cooperate on the first move of the game. It is 
in the short-run interest of the other player to Defect, of course, but is this really in his 
long-run interest? He might reason that if he defects, the other player may lose patience 
and simply play Defect himself from then on. Thus, the second player might lose in the 
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long run from playing the short-run optimal strategy. What lies behind this reasoning 
is the fact that a move that I make now may have repercussions in the future the other 
player's future choices may depend on my current choices. 

Let us ask whether the strategy of (Cooperate, Cooperate) can be a Nash equilibrium 
of the repeated Prisoner's Dilemma. First we consider the case of where each player 
knows that the game will be repeated a fixed number of times. Consider the reasoning 
of the players just before the last round of play. Each reasons that, at this point, they 
are playing a one-shot game. Since there is no future left on the last move, the standard 
logic for Nash equilibrium applies and both parties Defect. 

Now consider the move before the last. Here it seems that it might pay each of the 
players to cooperate in order to signal that they are "nice guys" who will cooperate again 
in the next and final move. But we've just seen that when the next move comes around, 
each player will want to play Defect. Hence there is no advantage to cooperating on the 
next to the last move — as long as both players believe that the other player will Defect 
on the final move, there is no advantage to try to influence future behavior by being nice 
on the penultimate move. The same logic of backwards induction works for two moves 
before the end, and so on. In a repeated Prisoner's Dilemma with a finite number 
of repetitions, the Nash equilibrium is still to Defect in every round. 

The situation is quite different in a repeated game with an infinite number of repeti- 
tions. In this case, at each stage it is known that the game will be repeated at least one 
more time and therefore there will be some (potential) benefits to cooperation. Let's see 
how this works in the case of the Prisoner's Dilemma. 

Consider a game that consists of an infinite number of repetitions of the Prisoner's 
Dilemma described earlier. The strategies in this repeated game are sequences of functions 
that indicate whether each player will Cooperate or Defect at a particular stage as a 
function of the history of the game up to that stage. The payoffs in the repeated game 
are the discounted sums of the payoffs at each stage; that is, if a player gets a payoff at 
time t of u t , his payoff in the repeated game is taken to be Y^=Q u t/{^- + r )*? where r is 
the discount rate. 

Now we show that as long as the discount rate is not too high there exists a 
Nash equilibrium pair of strategies such that each player finds it in his interest 
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to cooperate at each stage. In fact, it is easy to exhibit an explicit example of such 
strategies. Consider the following strategy: "Cooperate on the current move unless the 
other player defected on the last move. If the other player defected on the last move, then 
Defect forever." This is sometimes called a punishment strategy, for obvious reasons: if a 
player defects, he will be punished forever with a low payoff. 

To show that a pair of punishment strategies constitutes a Nash equilibrium, we simply 
have to show that if one player plays the punishment strategy the other player can do no 
better than playing the punishment strategy. Suppose that the players have cooperated up 
until move T and consider what would happen if a player decided to Defect on this move. 
Using the numbers from the Prisoner's Dilemma example, he would get an immediate 
payoff of 4, but he would also doom himself to an infinite stream of payments of 1. The 
discounted value of such a stream of payments is 1/r, so his total expected payoff from 
Defecting is 4 + 1/r. 

On the other hand, his expected payoff from continuing to cooperate is 3 + 3/r. 
Continuing to cooperate is preferred as long as3 + 3/r > 4 + 1/r, which reduces to 
requiring that r < 2. As long as this condition is satisfied, the punishment strategy forms 
a Nash equilibrium: if one party plays the punishment strategy, the other party will also 
want to play it, and neither party can gain by unilaterally deviating from this choice. 

This construction is quite robust. Essentially the same argument works for any payoffs 
that exceed the payoffs from (Defect, Defect). A famous result known as the Folk Theorem 
asserts precisely this: in a repeated Prisoner's Dilemma any payoff larger than the payoff 
received if both parties consistently defect can be supported as a Nash equilibrium. 

Example 5.4.1 ( Maintaining a Cartel) Consider a simple repeated duopoly which 
yields profits (ir c ,7i c ) if both firms choose to play a Cournot game and (iij, 7ij) if both 
firms produce the level of output that maximizes their joint profits - that is, they act 
as a cartel. It is well-known that the levels of output that maximize joint profits are 
typically not Nash equilibria in a single-period game — each producer has an incentive to 
dump extra output if he believes that the other producer will keep his output constant. 
However, as long as the discount rate is not too high, the joint profit-maximizing solution 
will be a Nash equilibrium of the repeated game. The appropriate punishment strategy is 
for each firm to produce the cartel output unless the other firm deviates, in which case it 
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will produce the Cournot output forever. An argument similar to the Prisoner's Dilemma 
argument shows that this is a Nash equilibrium. 

5.5 Refinements of Nash equilibrium 

The Nash equilibrium concept seems like a reasonable definition of an equilibrium of a 
game. As with any equilibrium concept, there are two questions of immediate interest: 
1) will a Nash equilibrium generally exist; and 2) will the Nash equilibrium be unique? 

Existence, luckily, is not a problem. Nash (1950) showed that with a finite number of 
agents and a finite number of pure strategies, an equilibrium will always exist. It may, 
of course, be an equilibrium involving mixed strategies. We will shown in Chapter 7 that 
it always exists a pure strategy Nash equilibrium if the strategy space is a compact and 
convex set and payoffs functions are continuous and quasi-concave. 

Uniqueness, however, is very unlikely to occur in general. We have already seen that 
there may be several Nash equilibria to a game. Game theorists have invested a substantial 
amount of effort into discovering further criteria that can be used to choose among Nash 
equilibria. These criteria are known as refinements of the concept of Nash equilibrium, 
and we will investigate a few of them below. 

5.5.1 Elimination of dominated strategies 

When there is no dominant strategy equilibrium, we have to resort to the idea of a Nash 
equilibrium. But typically there will be more than one Nash equilibrium. Our problem 
then is to try to eliminate some of the Nash equilibria as being "unreasonable." 

One sensible belief to have about players' behavior is that it would be unreasonable 
for them to play strategies that are dominated by other strategies. This suggests that 
when given a game, we should first eliminate all strategies that are dominated and then 
calculate the Nash equilibria of the remaining game. This procedure is called elimination 
of dominated strategies; it can sometimes result in a significant reduction in the number 
of Nash equilibria. 

For example consider the game given in Table 5.5. Note that there are two pure 
strategy Nash equilibria, (Top, Left) and (Bottom, Right). However, the strategy Right 
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Column 





Left 


Right 


Row Top 


(2,2) 


(0,2) 


Bottom 


(2,0) 


(1,1) 



Table 5.5: A Game With Dominated Strategies 

weakly dominates the strategy Left for the Column player. If the Row agent assumes 
that Column will never play his dominated strategy, the only equilibrium for the game is 
(Bottom, Right). 

Elimination of strictly dominated strategies is generally agreed to be an acceptable 
procedure to simplify the analysis of a game. Elimination of weakly dominated strategies 
is more problematic; there are examples in which eliminating weakly dominated strategies 
appears to change the strategic nature of the game in a significant way. 

5.5.2 Sequential Games and Subgame Perfect Equilibrium 

The games described so far in this chapter have all had a very simple dynamic structure: 
they were either one-shot games or a repeated sequence of one-shot games. They also 
had a very simple information structure: each player in the game knew the other player's 
payoffs and available strategies, but did not know in advance the other player's actual 
choice of strategies. Another way to say this is that up until now we have restricted our 
attention to games with simultaneous moves. 

But many games of interest do not have this structure. In many situations at least 
some of the choices are made sequentially, and one player may know the other player's 
choice before he has to make his own choice. The analysis of such games is of considerable 
interest to economists since many economic games have this structure: a monopolist gets 
to observe consumer demand behavior before it produces output, or a duopolist may 
observe his opponent's capital investment before making its own output decisions, etc. 
The analysis of such games requires some new concepts. 

Consider for example, the simple game depicted in Table 5.6. It is easy to verify that 
there are two pure strategy Nash equilibria in this game, (Top, Left) and (Bottom, Right). 
Implicit in this description of this game is the idea that both players make their choices 
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Column 





Left 


Right 


Row Top 


(1,9) 


(1,9) 


Bottom 


(0,0) 


(2,1) 



Table 5.6: The Payoff Matrix of a Simultaneous-Move Game 

simultaneously, without knowledge of the choice that the other player has made. But 
suppose that we consider the game in which Row must choose first, and Column gets to 
make his choice after observing Row's behavior. 

In order to describe such a sequential game it is necessary to introduce a new tool, the 
game tree. This is simply a diagram that indicates the choices that each player can make 
at each point in time. The payoffs to each player are indicated at the "leaves" of the tree, 
as in Figure 5.1. This game tree is part of a description of the game in extensive form. 

,9 



1,9 
0,0 

Column ~ 

2,1 

Figure 5.1: A game tree. This illustrates the payoffs to the previous game where Row 
gets to move first. 

The nice thing about the tree diagram of the game is that it indicates the dynamic 
structure of the game — that some choices are made before others. A choice in the game 
corresponds to the choice of a branch of the tree. Once a choice has been made, the 
players are in a subgame consisting of the strategies and payoffs available to them from 
then on. 

It is straightforward to calculate the Nash equilibria in each of the possible subgames, 

176 




particularly in this case since the example is so simple. If Row chooses top, he effectively 
chooses the very simple subgame in which Column has the only remaining move. Column 
is indifferent between his two moves, so that Row will definitely end up with a payoff of 
1 if he chooses Top. 

If Row chooses Bottom, it will be optimal for Column to choose Right, which gives a 
payoff of 2 to Row. Since 2 is larger than 1, Row is clearly better off choosing Bottom 
than Top. Hence the sensible equilibrium for this game is (Bottom, Right). This is, of 
course, one of the Nash equilibria in the simultaneous-move game. If Column announces 
that he will choose Right, then Row's optimal response is Bottom, and if Row announces 
that he will choose Bottom then Column's optimal response is Right. 

But what happened to the other equilibrium, (Top, Left)? If Row believes that Column 
will choose Left, then his optimal choice is certainly to choose Top. But why should Row 
believe that Column will actually choose Left? Once Row chooses Bottom, the optimal 
choice in the resulting subgame is for Column to choose Right. A choice of Left at this 
point is not an equilibrium choice in the relevant subgame. 

In this example, only one of the two Nash equilibria satisfies the condition that it is 
not only an overall equilibrium, but also an equilibrium in each of the subgames. A Nash 
equilibrium with this property is known as a subgame perfect equilibrium. 

It is quite easy to calculate subgame-perfect equilibria, at least in the kind of games 
that we have been examining. One simply does a "backwards induction" starting at the 
last move of the game. The player who has the last move has a simple optimization 
problem, with no strategic ramifications, so this is an easy problem to solve. The player 
who makes the second to the last move can look ahead to see how the player with the 
last move will respond to his choices, and so on. The mode of analysis is similar to that 
of dynamic programming. Once the game has been understood through this backwards 
induction, the agents play it going forwards. 

The extensive form of the game is also capable of modelling situations where some of 
the moves are sequential and some are simultaneous. The necessary concept is that of 
an information set. The information set of an agent is the set of all nodes of the tree 
that cannot be differentiated by the agent. For example, the simultaneous-move game 
depicted at the beginning of this section can be represented by the game tree in Figure 
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5.2. In this figure, the shaded area indicates that Column cannot differentiate which of 
these decisions Row made at the time when Column must make his own decision. Hence, 
it is just as if the choices are made simultaneously. 



J» 1,9 

Column 




Figure 5.2: Information set. This is the extensive form to the original simultaneous-move 
game. The shaded information set indicates that column is not aware of which choice 
Row made when he makes his own decision. 



Thus the extensive form of a game can be used to model everything in the strategic 
form plus information about the sequence of choices and information sets. In this sense the 
extensive form is a more powerful concept than the strategic form, since it contains more 
detailed information about the strategic interactions of the agents. It is the presence 
of this additional information that helps to eliminate some of the Nash equilibria as 
"unreasonable." 

Example 5.5.1 (A Simple Bargaining Model) Two players, A and B, have $1 to 
divide between them. They agree to spend at most three days negotiating over the 
division. The first day, A will make an offer, B either accepts or comes back with a 
counteroffer the next day, and on the third day A gets to make one final offer. If they 
cannot reach an agreement in three days, both players get zero. 

A and B differ in their degree of impatience: A discounts payoffs in the future at a 
rate of a per day, and B discounts payoffs at a rate of (3 per day. Finally, we assume that 
if a player is indifferent between two offers, he will accept the one that is most preferred 
by his opponent. This idea is that the opponent could offer some arbitrarily small amount 

178 



that would make the player strictly prefer one choice, and that this assumption allows us 
to approximate such an "arbitrarily small amount" by zero. It turns out that there is a 
unique subgame perfect equilibrium of this bargaining game. 

As suggested above, we start our analysis at the end of the game, right before the last 
day. At this point A can make a take-it-or-leave-it offer to B. Clearly, the optimal thing 
for A to do at this point is to offer B the smallest possible amount that he would accept, 
which, by assumption, is zero. So if the game actually lasts three days, A would get 1 
and B would get zero (i.e., an arbitrarily small amount). 

Now go back to the previous move, when B gets to propose a division. At this point 
B should realize that A can guarantee himself 1 on the next move by simply rejecting 5's 
offer. A dollar next period is worth a to A this period, so any offer less than a would 
be sure to be rejected. B certainly prefers 1 — a now to zero next period, so he should 
rationally offer a to A, which A will then accept. So if the game ends on the second move, 
A gets a and B gets 1 — a. 

Now move to the first day. At this point A gets to make the offer and he realizes that 
B can get 1 — a if he simply waits until the second day. Hence A must offer a payoff that 
has at least this present value to B in order to avoid delay. Thus he offers (3(1 — a) to 
B. B finds this (just) acceptable and the game ends. The final outcome is that the game 
ends on the first move with A receiving 1 — (3(1 — a) and B receiving (3(1 — a). 



Figure 5.3: A bargaining game. The heavy line connets together the equilibrium outcomes 
in the subgames. The point on the outer-most line is the sungame-perfect equilibrium. 

Figure 5.3 illustrates this process for the case where a = (3 < 1. The outermost 
diagonal line shows the possible payoff patterns on the first day, namely all payoffs of 
the form xa + %b = 1- The next diagonal line moving towards the origin shows the 
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present value of the payoffs if the game ends in the second period: xa + %b = ot. The 
diagonal line closest to the origin shows the present value of the payoffs if the game ends 
in the third period; this equation for this line is x\ + xb = o? . The right angled path 
depicts the minimum acceptable divisions each period, leading up to the final subgame 
perfect equilibrium. Figure 5.3B shows how the same process looks with more stages in 
the negotiation. 

It is natural to let the horizon go to infinity and ask what happens in the infinite 
game. It turns out that the subgame perfect equilibrium division is payoff to A = 
and payoff to B = . Note that if a = 1 and f3 < 1, then player A receives the 

entire payoff, in accord with the principal expressed in the Gospels: "Let patience have 
her [subgame] perfect work." (James 1:4). 

5.5.3 Repeated games and subgame perfection 

The idea of subgame perfection eliminates Nash equilibria that involve players threatening 
actions that are not credible - i.e., they are not in the interest of the players to carry 
out. For example, the Punishment Strategy described earlier is not a subgame perfect 
equilibrium. If one player actually deviates from the (Cooperate, Cooperate) path, then it 
is not necessarily in the interest of the other player to actually defect forever in response. 
It may seem reasonable to punish the other player for defection to some degree, but 
punishing forever seems extreme. 

A somewhat less harsh strategy is known as Tit-for-Tat: start out cooperating on the 
first play and on subsequent plays do whatever your opponent did on the previous play. 
In this strategy, a player is punished for defection, but he is only punished once. In this 
sense Tit-for-Tat is a "forgiving" strategy. 

Although the punishment strategy is not subgame perfect for the repeated Prisoner's 
Dilemma, there are strategies that can support the cooperative solution that are subgame 
perfect. These strategies are not easy to describe, but they have the character of the West 
Point honor code: each player agrees to punish the other for defecting, and also punish 
the other for failing to punish another player for defecting, and so on. The fact that you 
will be punished if you don't punish a defector is what makes it subgame perfect to carry 
out the punishments. 
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Unfortunately, the same sort of strategies can support many other outcomes in the 
repeated Prisoner's Dilemma. The Folk Theorem asserts that essentially all distributions 
of utility in a repeated one-shot game can be equilibria of the repeated game. 

This excess supply of equilibria is troubling. In general, the larger the strategy space, 
the more equilibria there will be, since there will be more ways for players to "threaten" 
retaliation for defecting from a given set of strategies. In order to eliminate the "unde- 
sirable" equilibria, we need to find some criterion for eliminating strategies. A natural 
criterion is to eliminate strategies that are "too complex." Although some progress has 
been made in this direction, the idea of complexity is an elusive one, and it has been hard 
to come up with an entirely satisfactory definition. 

5.6 Games with incomplete information 
5.6.1 Bayes-Nash Equilibrium 

Up until now we have been investigating games of complete information. In particular, 
each agent has been assumed to know the payoffs of the other player, and each player 
knows that the other agent knows this, etc. In many situations, this is not an appropriate 
assumption. If one agent doesn't know the payoffs of the other agent, then the Nash 
equilibrium doesn't make much sense. However, there is a way of looking at games of 
incomplete information due to Harsanyi (1967) that allows for a systematic analysis of 
their properties. 

The key to the Harsanyi approach is to subsume all of the uncertainty that one agent 
may have about another into a variable known as the agent's type. For example, one 
agent may be uncertain about another agent's valuation of some good, about his or her 
risk aversion and so on. Each type of player is regarded as a different player and each 
agent has some prior probability distribution defined over the different types of agents. 

A Bayes-Nash equilibrium of this game is then a set of strategies for each type 
of player that maximizes the expected value of each type of player, given the strategies 
pursued by the other players. This is essentially the same definition as in the definition 
of Nash equilibrium, except for the additional uncertainty involved about the type of the 
other player. Each player knows that the other player is chosen from a set of possible 
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types, but doesn't know exactly which one he is playing. Note in order to have a complete 
description of an equilibrium we must have a list of strategies for all types of players, not 
just the actual types in a particular situation, since each individual player doesn't know 
the actual types of the other players and has to consider all possibilities. 

In a simultaneous-move game, this definition of equilibrium is adequate. In a sequential 
game it is reasonable to allow the players to update their beliefs about the types of the 
other players based on the actions they have observed. Normally, we assume that this 
updating is done in a manner consistent with Bayes' rule. Thus, if one player observes 
the other choosing some strategy s, the first player should revise his beliefs about what 
type the other player is by determining how likely it is that s would be chosen by the 
various types. 

Example 5.6.1 (A sealed-bid auction) Consider a simple sealed-bid auction for an 
item in which there are two bidders. Each player makes an independent bid without 
knowing the other player's bid and the item will be awarded to the person with the 
highest bid. Each bidder knows his own valuation of the item being auctioned, v, but 
neither knows the other's valuation. However, each player believes that the other person's 
valuation of the item is uniformly distributed between and 1. (And each person knows 
that each person believes this, etc.) 

In this game, the type of the player is simply his valuation. Therefore, a Bayes-Nash 
equilibrium to this game will be a function, b(v), that indicates the optimal bid, b, for a 
player of type v. Given the symmetric nature of the game, we look for an equilibrium 
where each player follows an identical strategy. 

It is natural to guess that the function b(v) is strictly increasing; i.e., higher valuations 
lead to higher bids. Therefore, we can let V(b) be its inverse function so that V(b) gives 
us the valuation of someone who bids b. When one player bids some particular b, his 
probability of winning is the probability that the other player's bid is less than b. But 
this is simply the probability that the other player's valuation is less than V(b). Since v is 
uniformly distributed between and 1, the probability that the other player's valuation 
is less than V(b) is V(b). 

Hence, if a player bids b when his valuation is v, his expected payoff is 

(v-b)V(b) + 0[l-V(b)}. 
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The first term is the expected consumer's surplus if he has the highest bid; the second 
term is the zero surplus he receives if he is outbid. The optimal bid must maximize this 
expression, so 

(v - b)V'(b) - V(b) = 0. 

For each value of v, this equation determines the optimal bid for the player as a function 
of v. Since V(b) is by hypothesis the function that describes the relationship between the 
optimal bid and the valuation, we must have 

(V(b) - b)V\b) = V(b) 

for all b. 

The solution to this differential equation is 

V(b) = b+Vb 2 + 2C, 

where C is a constant of integration. In order to determine this constant of integration 
we note that when v = we must have 6 = 0, since the optimal bid when the valuation 
is zero must be 0. Substituting this into the solution to the differential equation gives us 

= + V2C, 

which implies C — 0. It follows that V(b) = 2b, or b = v/2, is a Bayes-Nash equilibrium 
for the simple auction. That is, it is a Bayes-Nash equilibrium for each player to bid half 
of his valuation. 

The way that we arrived at the solution to this game is reasonably standard. Essen- 
tially, we guessed that the optimal bidding function was invertible and then derived the 
differential equation that it must satisfy. As it turned out, the resulting bid function had 
the desired property. One weakness of this approach is that it only exhibits one particular 
equilibrium to the Bayesian game — there could in principle be many others. 

As it happens, in this particular game, the solution that we calculated is unique, but 
this need not happen in general. In particular, in games of incomplete information it may 
well pay for some players to try to hide their true type. For example, one type may try 
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to play the same strategy as some other type. In this situation the function relating type 
to strategy is not invertible and the analysis is much more complex. 

5.6.2 Discussion of Bayesian-Nash equilibrium 

The idea of Bayesian-Nash equilibrium is an ingenious one, but perhaps it is too ingenious. 
The problem is that the reasoning involved in computing Bayesian-Nash equilibria is often 
very involved. Although it is perhaps not unreasonable that purely rational players would 
play according to the Bayesian-Nash theory, there is considerable doubt about whether 
real players are able to make the necessary calculations. 

In addition, there is a problem with the predictions of the model. The choice that each 
player makes depends crucially on his beliefs about the distribution of various types in 
the population. Different beliefs about the frequency of different types leads to different 
optimal behavior. Since we generally don't observe players beliefs about the prevalence 
of various types of players, we typically won't be able to check the predictions of the 
model. Ledyard (1986) has shown that essentially any pattern of play is a Bayesian-Nash 
equilibrium for some pattern of beliefs. 

Nash equilibrium, in its original formulation, puts a consistency requirement on the be- 
liefs of the agents - only those beliefs compatible with maximizing behavior were allowed. 
But as soon as we allow there to be many types of players with different utility functions, 
this idea loses much of its force. Nearly any pattern of behavior can be consistent with 
some pattern of beliefs. 
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Chapter 6 

Theory of the Market 



6.1 Introduction 

In previous chapters, we studied the behavior of individual consumers and firms, describ- 
ing optimal behavior when markets prices were fixed and beyond the agent's control. 
Here, we explore the consequences of that behavior when consumers and firms come to- 
gether in markets. We will consider equilibrium price and quantity determination in a 
single market or group of closed related markets by the actions of the individual agents for 
different markets structures. This equilibrium analysis is called a partial equilibrium 
analysis because it focus on a single market or group of closed related markets, implicitly 
assuming that changes in the markets under consideration do not change prices of other 
goods and upset the equilibrium that holds in those markets. We will treat all markets 
simultaneously in the general equilibrium theory. 

We will concentrate on modelling the market behavior of the firm. How do firms 
determine the price at which they will sell their output or the prices at which they are 
willing to purchase inputs? We will see that in certain situations the "price-taking be- 
havior" might be a reasonable approximation to optimal behavior, but in other situations 
we will have to explore models of the price-setting process. We will first consider the 
ideal (benchmark) case of perfect competition. We then turn to the study of settings in 
which some agents have market power. These settings include markets structures of pure 
monopoly, monopolistic competition, oligopoly, and monopsony. 
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6.2 The Role of Prices 

The key insight of Adam Smith's Wealth of Nations is simple: if an exchange between 
two parties is voluntary, it will not take place unless both believe they will benefit from 
it. How is this also true for any number of parties and for production case? The price 
system is the mechanism that performs this task very well without central direction. 

Prices perform three functions in organizing economic activities in a free market econ- 
omy: 

(1) They transmit information about production and consumption. The price 
system transmit only the important information and only to the people 
who need to know. Thus, it transmits information in an efficiently way. 

(2) They provide right incentives. One of the beauties of a free price system is 
that the prices that bring the information also provide an incentive to react 
on the information not only about the demand for output but also about 
the most efficient way to produce a product. They provide incentives to 
adopt those methods of production that are least costly and thereby use 
available resources for the most highly valued purposes. 

(3) They determine the distribution of income. They determine who gets 
how much of the product. In general, one cannot use prices to transmit 
information and provide an incentive to act that information without using 
prices to affect the distribution of income. If what a person gets does not 
depend on the price he receives for the services of this resources, what 
incentive does he have to seek out information on prices or to act on the 
basis of that information? 

6.3 Perfect Competition 

Let us start to consider the case of pure competition in which there are a large number of 
independent sellers of some uniform product. In this situation, when each firm sets the 
price in which it sells its output, it will have to take into account not only the behavior 
of the consumers but also the behavior of the other producers. 
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6.3.1 Assumptions on Competitive Market 

The competitive markets are based on the following assumptions: 

(1) Large number of buyers and sellers — price-taking behavior 

(2) Unrestricted mobility of resources among industries: no artificial barrier 
or impediment to entry or to exit from market. 

(3) Homogeneous product: All the firms in an industry produce an identical 
production in the consumers' eyes. 

(4) Passion of all relevant information (all relevant information are common 
knowledge): Firms and consumers have all the information necessary to 
make the correct economic decisions. 

6.3.2 The Competitive Firm 

A competitive firm is one that takes the market price of output as being given. 

Let p be the market price. Then the demand curve facing an ideal competitive firm 
takes the form 

if p > p 

D(p) = \ any amount if p = p 

I oo if p < p 

A competitive firm is free to set whatever price it wants and produce whatever quantity 
it is able to produce. However, if a firm is in a competitive market, it is clear that each 
firm that sells the product must sell it for the same price: for if any firm attempted to 
set its price at a level greater than the market price, it would immediately lose all of its 
customers. If any firm set its price at a level below the market price, all of the consumers 
would immediately come to it. Thus, each firm must take the market price as given, 
exogenous variable when it determines its supply decision. 

6.3.3 The Competitive Firm's Short-Run Supply Function 

Since the competitive firm must take the market price as given, its profit maximization 
problem is simple. The firm only needs to choose output level y so as to solve 

max py — c(y) (6.1) 
y 
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where y is the output produced by the firm, p is the price of the product, and c(y) is the 
cost function of production. 

The first-order condition (in short, FOC) for interior solution gives: 

p = c\y) = MC(y). (6.2) 

The first order condition becomes a sufficient condition if the second-order condition (in 
short, SOC) is satisfied 

c"(y) > 0. (6.3) 

Taken together, these two conditions determine the supply function of a competitive 
firm: at any price p, the firm will supply an amount of output y{p) such that p = c'{y{p)). 
By p = c'(y(p)), we have 

1 = c"(y(p))y'(p) (6.4) 

and thus 

y'{p) > 0, (6.5) 

which means the law of supply holds. 

Recall that p = c'(y*) is the first-order condition characterizing the optimum only if 
y* > 0, that is, only if y* is an interior optimum. It could happen that at a low price a 
firm might very well decide to go out of business. For the short-run (in short, SR) case, 

c(y) = c v (y) + F (6.6) 

The firm should produce if 

py(p) - c v {y) - F ^ -F, (6.7) 

and thus we have 

p }> = AVC. (6.8) 

yip) 

That is, the necessary condition for the firm to produce a positive amount of output is 
that the price is greater than or equal to the average variable cost. 

Thus, the supply curve for the competitive firm is in general given by: p = c'{y) if 
p ^ Cv ^p)^ an d y = if p ^ Cv( jf(p)^ ■ That is, the supply curve coincides with the upward 
sloping portion of the marginal cost curve as long as the price covers average variable 
cost, and the supply curve is zero if price is less than average variable cost. 
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Figure 6.1: Firm's supply curve, and AC, AVC, and MC Curves. 

Suppose that we have j firms in the market. (For the competitive model to make 
sense, j should be rather large.) The industry supply function is simply the sum of all 
individuals' supply functions so that it is given by 

j 

y(p) = ^2yj(p) ( 6 - 9 ) 

where yi(p) is the supply function of firm j for j = 1, . . . , J. Since each firm chooses a level 
of output where price equals marginal cost, each firm that produces a positive amount 
of output must have the same marginal cost. The industry supply function measures the 
relationship between industry output and the common cost of producing this output. 

The aggregate (industry) demand function measures the total output demanded at 
any price which is given by 

n 

x(p) = ^2 / x i {p) (6.10) 

i=i 

where Xi(p) is the demand function of consumer % for % — 1, . . . , n. 
6.3.4 Partial Market Equilibrium 

How is the market price determined? It is determined by the requirement that the total 
amount of output that the firms wish to supply will be equal to the total amount of output 
that the consumers wish to demand. Formerly, we have 

A partial equilibrium price p* is a price where the aggregate quantity demanded equals 
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the aggregate quantity supplied. That is, it is the solution of the following equation: 

n J 



3=1 



(6.11) 



Once this equilibrium price is determined, we can go back to look at the individual 
supply schedules of each firm and determine the firms level of output, its revenue, and its 
profits. In Figure 6.2, we have depicted cost curves for three firms. The first has positive 
profits, the second has zero profits, and the third has negative profits. Even though the 
third firm has negative profits, it may make sense for it to continue to produce as long as 
its revenues cover its variable costs. 



MC 



/AC 
/ 



\ 



MC AC 




Figure 6.2: Positive, zero, and negative profits. 



Example 6.3.1 x(p) = a — bp and c(y) = y 2 + 1. Since MC(y) = 2y, we have 

v = \ (6-12) 

and thus the industry supply function is 

m = f (6.i3) 

Setting a — bp = 4f , we have 

Now for general case of D(p) and S(p), what happens about the equilibrium price if 
the number of firms increases? From 

D(p(J)) = Jy(p(J)) 
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we have 



D\p{J))p'{J) = y{p) + Jy'(p(J))p'(J) 



and thus 



P(J) 



yip) 



<o, 



X'(p) - Jy'(p) 



which means the equilibrium price decreases when the number of firms increases. 
6.3.5 Competitive in the Long Run 

The long-run behavior of a competitive industry is determined by two sorts of effects. The 
first effect is the free entry and exit phenomena so that the profits made by all firms are 
zero. If some firm is making negative profits, we would expect that it would eventually 
have to change its behavior. Conversely, if a firm is making positive profits we would 
expect that this would eventually encourage entry to the industry. If we have an industry 
characterized by free entry and exit, it is clear that in the long run all firms must make the 
same level of profits. As a result, every firm makes zero profit at the long-run competitive 
equilibrium. 



The second influence on the long-run behavior of a competitive industry is that of 
technological adjustment. In the long run, firms will attempt to adjust their fixed factors 
so as to produce the equilibrium level of output in the cheapest way. Suppose for example 
we have a competitive firm with a long-run constant returns-to-scale technology that is 
operating at the position illustrated in Figure 6.3. Then in the long run it clearly pays 
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Figure 6.3: Long-run adjustment with constant costs. 
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the firm to change its fixed factors so as to operate at a point of minimum average cost. 
But, if every firm tries to do this, the equilibrium price will certainly change. 

In the model of entry or exit, the equilibrium number of firms is the largest number 
of firms that can break even so the price must be chosen to minimum price. 

Example 6.3.2 c(y) = y 2 + 1. The break-even level of output can be found by setting 

AC(y) = MC(y) 

so that y = 1, and p = MC(y) = 2. 

Suppose the demand is linear: X(p) = a — bp. Then, the equilibrium price will be the 
smallest p* that satisfies the conditions 



As J increases, the equilibrium price must be closer and closer to 2. 



6.4 Pure Monopoly 

6.4.1 Profit Maximization Problem of Monopolist 

At the opposite pole from pure competition we have the case of pure monopoly. Here 
instead of a large number of independent sellers of some uniform product, we have only 
one seller. A monopolistic firm must make two sorts of decisions: how much output it 
should produce, and at what price it should sell this output. Of course, it cannot make 
these decisions unilaterally. The amount of output that the firm is able to sell will depend 
on the price that it sets. We summarize this relationship between demand and price in a 
market demand function for output, y(p). The market demand function tells how much 
output consumers will demand as a function of the price that the monopolist charges. It 
is often more convenient to consider the inverse demand function p(y), which indicates 
the price that consumers are willing to pay for y amount of output. We have provided 
the conditions under which the inverse demand function exists in Chapter 2. The revenue 
that the firm receives will depend on the amount of output it chooses to supply. We write 
this revenue function as R(y) = p(y)y- 
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The cost function of the firm also depends on the amount of output produced. This 
relationship was extensively studied in Chapter 3. Here we have the factor prices as 
constant so that the conditional cost function can be written only as a function of the 
level of output of the firm. 

The profit maximization problem of the firm can then be written as: 

max R(y) - c(y) = maxp(y)y - c(y) 

The first-order conditions for profit maximization are that marginal revenue equals marginal 
cost, or 

p(y*)+p(y*)y* = c(y*) 

The intuition behind this condition is fairly clear. If the monopolist considers producing 
one extra unit of output he will increase his revenue by p(y*) dollars in the first instance. 
But this increased level of output will force the price down by p'(y*), and he will lose this 
much revenue on unit of output sold. The sum of these two effects gives the marginal 
revenue. If the marginal revenue exceeds the marginal cost of production the monopolist 
will expand output. The expansion stops when the marginal revenue and the marginal 
cost balance out. 

The first-order conditions for profit maximization can he expressed in a slightly dif- 
ferent manner through the use of the price elasticity of demand. The price elasticity of 
demand is given by: 

= p dyjp) 
yip) dp 

Note that this is always a negative number since dy{p)/dp is negative. 

Simple algebra shows that the marginal revenue equals marginal cost condition can 
be written as: 



p(y* 



y* dp(y* 



P(y*) dy 



e(y* 



= c'(y*) 



p(y* 

that is, the price charged by a monopolist is a markup over marginal cost, with the level 
of the markup being given as a function of the price elasticity of demand. 

There is also a nice graphical illustration of the profit maximization condition. Suppose 
for simplicity that we have a linear inverse demand curve: p(y) = a — by. Then the revenue 
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function is R(y) = ay — by 2 , and the marginal revenue function is just R'(y) = a — 2by. 
The marginal revenue curve has the same vertical intercept as the demand curve hut is 
twice as steep. We have illustrated these two curves in Figure 6.4, along with the average 
cost and marginal cost curves of the firm in question. 
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Figure 6.4: Determination of profit-maximizing monopolist's price and output. 

The optimal level of output is located where the marginal revenue and the marginal 
cost curves intersect. This optimal level of output sells at a price p(y*) so the monopolist 
gets an optimal revenue of p(y*)y*. The cost of producing y* is just y* times the average 
cost of production at that level of output. The difference between these two areas gives 
us a measure of the monopolist's profits. 

6.4.2 Inefficiency of Monopoly 

We say that a situation is Pareto efficient if there is no way to make one agent better off 
and the others are not worse off. Pareto efficiency will be a major theme in the discussion 
of welfare economics, but we can give a nice illustration of the concept here. 

Let us consider the typical monopolistic configuration illustrated in Figure 6.5. It 
turns out a monopolist always operates in a Pareto inefficient manner. This means that 
there is some way to make the monopolist is better off and his customers are not worse 
off. 
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Figure 6.5: Monopoly results in Pareto inefficient outcome. 



To see this let us think of the monopolist in Figure 6.5 after he has sold y m of output 
at the price p m , and received his monopolist profit. Suppose that the monopolist were 
to produce a small unit of output Ay more and offer to the public. How much would 
people be willing to pay for this extra unit? Clearly they would be willing to pay a price 
p{y m + Ay) dollars. How much would it cost to produce this extra output? Clearly, just 
the marginal cost MC(y m + Ay). Under this rearrangement the consumers are at least not 
worse off since they are freely purchasing the extra unit of output, and the monopolist 
is better off since he can sell some extra units at a price that exceeds the cost of its 
production. Here we are allowing the monopolist to discriminate in his pricing: he first 
sells y m and then sells more output at some other price. 

How long can this process be continued? Once the competitive level of output is 
reached, no further improvements are possible. The competitive level of price and output 
is Pareto efficient for this industry. We will investigate the concept of Pareto efficiency in 
general equilibrium theory. 

6.4.3 Monopoly in the Long Run 

We have seen how the long-run and the short-run behavior of a competitive industry 
may differ because of changes in technology and entry. There are similar effects in a 
monopolized industry. The technological effect is the simplest: the monopolist will choose 
the level of his fixed factors so as to maximize his long-run profits. Thus, he will operate 
where marginal revenue equals long-run marginal cost, and that is all that needs to be 
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said. 

The entry effect is a bit more subtle. Presumably, if the monopolist is earning positive 
profits, other firms would like to enter the industry. If the monopolist is to remain a 
monopolist, there must be some sort of barrier to entry so that a monopolist may make 
positive profits even in the long-run. 

These barriers to entry may be of a legal sort, but often they are due to the fact that 
the monopolist owns some unique factor of production. For example, a firm might own 
a patent on a certain product, or might own a certain secret process for producing some 
item. If the monopoly power of the firm is due to a unique factor we have to be careful 
about we measure profits. 

6.5 Monopolistic Competition 

Recall that we assumed that the demand curve for the monopolist's product depended only 
on the price set by the monopolist. However, this is an extreme case. Most commodities 
have some substitutes and the prices of those substitutes will affect the demand for the 
original commodity. The monopolist sets his price assuming all the producers of other 
commodities will maintain their prices, but of course, this will not be true. The prices set 
by the other firms will respond — perhaps indirectly - to the price set by the monopolist 
in question. In this section we will consider what happens when several monopolists 
"compete" in setting their prices and output levels. 

We imagine a group of n "monopolists" who sell similar, but not identical products. 
The price that consumers are willing to pay for the output of firm % depends on the level 
of output of firm % but also on the levels of output of the other firms: we write this inverse 
demand function as Pi(yi, y) where y — (yi . . . y n ). 

Each firm is interested in maximizing profits: that is, each firm i is wants to choose 
its level of output y^ so as to maximize: 

Pi(yi,y)yi-c(yi) 

Unfortunately, the demand facing the i th firm depends on what the other firms do. How 
is firm i supposed to forecast the other firms behavior? 
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We will adopt a very simple behaviorial hypothesis: namely, that firm % assumes the 
other firms behavior will be constant. Thus, each firm % will choose its level of output y* 
so as to satisfy: 

Piivl y) + dpi g^ v) y*i - <(y*) ^ with e( i ualit y if vt > 

For each combination of operating levels for the firms y±, . . . ,y n , there will be some 
optimal operating level for firm i. We will denote this optimal choice of output by 
yi, . . . ,y n )- (Of course the output of firm i is not an argument of this function but it 
seems unnecessary to devise a new notation just to reflect that fact.) 

In order for the market to be in equilibrium, each firm's forecast about the behavior of 
the other firms must be compatible with the other firms actually do. Thus if (y\, . . . , y*) 
is to be an equilibrium it must satisfy: 

y\ = YM,.-.,y* n ) 
y* n = Y n(y*i,---,y* n ) 

that is, y\ must be the optimal choice for firm i if it assumes the other firms are going to 
produce y%, ■ ■ ■ , y* w and so on. Thus a monopolistic competition equilibrium (yl, . . . , y*) 
must satisfy: 

Pi(y*,y) + dj ^fyP- V: ~ <M) ^ with equality if y* > 
and i — 1, . . . , n. 

For each firm, its marginal revenue equals its marginal cost, given the actions of all 
the other firms. This is illustrated in Figure 6.6. Now, at the monopolistic competition 
equilibrium depicted in Figure 6.6, firm % is making positive profits. We would therefore 
expect others firms to enter the industry and share the market with the firm so the firm's 
profit will decrease because of close substitute goods. Thus, in the long run, firms would 
enter the industry until the profits of each firm were driven to zero. This means that firm 
% must charge a price p* and produce an output y* such that: 

PiVi -Ci(y*)^0 with equality if y* > 
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p 




Figure 6.6: Short-run monopolistic competition equilibrium 

or 

P* ~ = with equality if y* > i = l,2,...,ra. 

Vi 

Thus, the price must equal to average cost and on the demand curve facing the firm. 
As a result, as long as the demand curve facing each firm has some negative slope, each 
firm will produce at a point where average cost are greater than the minimum average 
costs. Thus, like a pure competitive firms, the profits made by each firm are zero and 
is very nearly the long run competitive equilibrium. On the other hand, like a pure 
monopolist, it still results in inefficient allocation as long as the demand curve facing the 
firm has a negative slope. 



6.6 Oligopoly 

Oligopoly is the study of market interactions with a small number of firms. Such an 
industry usually does not exhibit the characteristics of perfect competition, since individ- 
ual firms' actions can in fact influence market price and the actions of other firms. The 
modern study of this subject is grounded almost entirely in the theory of games discussed 
in the last chapter. Many of the specifications of strategic market interactions have been 
clarified by the concepts of game theory. We now investigate oligopoly theory primarily 
from this perspective by introducing four models. 
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6.6.1 Cournot Oligopoly 

A fundamental model for the analysis of oligopoly was the Cournot oligopoly model that 
was proposed by Cournot, an French economist, in 1838. A Cournot equilibrium, 
already mentioned in the last chapter, is a special set of production levels that have the 
property that no individual firm has an incentive to change its own production level if 
other firms do not change theirs. 

To formalize this equilibrium concept, suppose there are J firms producing a single 
homogeneous product. If firm j produces output level qj, the firm's cost is Cj(qj). There 
is a single market inverse demand function p(q). The total supply is q = X)/=i9j- The 
profit to firm j is 

p(q)qj -Cj(qj) 

Definition 6.6.1 (Cournot Equilibrium) A set of output levels q±,q 2 , ■ ■ ■ ,qj consti- 
tutes a Cournot equilibrium if for each j = 1,2, ... , J the profit to firm j cannot be 
increased by changing qj alone. 

Accordingly, the Cournot model can be regarded as one shot game: the profit of firm 
j is its payoff, and the strategy space of firm j is the set of outputs, and thus a Cournot 
equilibrium is just a pure strategy Nash equilibrium. Then the first-order conditions for 
the interior optimum are: 

p'(q) qj + p(q) - c'^qj) = j = 1, 2, . . . , J. 

The first-order condition for firm determines firm j optimal choice of output as a function 
of its beliefs about the sum of the other firms' outputs, denoted by i.e., the FOC 
condition can be written as 

P'(0j + Q-j)Qj + P(<l) - c 'j (Qj) = .7 = 1,2,..., J. 

The solution to the above equation, denoted by Qj(q~j), is called the reaction function 
to the total outputs produced by the other firms. 

Reaction functions give a direct characterization of a Cournot equilibrium. A set of 
output levels qi, q%, . . . , qj constitutes a Cournot equilibrium if for each reaction function 
given qj = Qj(q-j) j = 1, 2, . . . , J. 
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An important special case is that of duopoly, an industry with just two firms. In this 
case, the reaction function of each firm is a function of just the other firm's output. Thus, 
the two reaction functions have the form Qi(q 2 ) and Q 2 (qi), which is shown in Figure 
6.7. In the figure, if firm 1 selects a value q± on the horizontal axis, firm 2 will react by 
selecting the point on the vertical axis that corresponds to the function Q 2 (q±). Similarly, 
if firm 2 selects a value q 2 on the vertical axis, firm 1 will be reacted by selecting the 
point on the horizontal axis that corresponds to the curve Qi(q 2 )- The equilibrium point 
corresponds to the point of intersection of the two reaction functions. 




Figure 6.7: Reaction functions. 
6.6.2 Stackelberg Model 

There are alternative methods for characterizing the outcome of an oligopoly. One of the 
most popular of these is that of quantity leadership, also known as the Stackelberg model. 

Consider the special case of a duopoly. In the Stackelberg formulation one firm, say 
firm 1, is considered to be the leader and the other, firm 2, is the follower. The leader may, 
for example, be the larger firm or may have better information. If there is a well-defined 
order for firms committing to an output decision, the leader commits first. 

Given the committed production level q\ of firm 1, firm 2, the follower, will select q 2 
using the same reaction function as in the Cournot theory. That is, firm 2 finds q 2 to 
maximize 

tt 2 = p(qi + <? 2 )<?2 - c 2 (q 2 ), 
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where p(q± + 52) is the industrywide inverse demand function. This yields the reaction 
function (^(^l)- 

Firm 1, the leader, accounts for the reaction of firm 2 when originally selecting q\. In 
particular, firm 1 selects q± to maximize 

tti = p(qi + Q2(qi))qi ~ ci(gi), 

That is, firm 1 substitutes Q2{qi) for ^2 in the profit expression. 

Note that a Stackelberg equilibrium does not yield a system of equations that must be 
solved simultaneously. Once the reaction function of firm 2 is found, firm l's problem can 
be solved directly. Usually, the leader will do better in a Stackelberg equilibrium than in 
a Cournot equilibrium. 

6.6.3 Bertrand Model 

Another model of oligopoly of some interest is the so-called Bertrand model. The Cournot 
model and Stackelberg model take the firms' strategy spaces as being quantities, but it 
seems equally natural to consider what happens if price is chosen as the relevant strategic 
variables. Almost 50 years after Cournot, another French economist, Joseph Bertrand 
(1883), offered a different view of firm under imperfect competition and is known as the 
Bertrand model of oligopoly. Bertrand argued that it is much more natural to think 
of firms competing in their choice of price, rather than quantity. This small difference 
completely change the character of market equilibrium. This model is striking, and it 
contrasts starkly with what occurs in the Cournot model: With just two firms in a 
market, we obtain a perfectly competitive outcome in the Bertrand model! 

In a simple Bertrand duopoly, two firms produce a homogeneous product, each has 
identical marginal costs c > and face a market demand curve of D(p) which is contin- 
uous, strictly decreasing at all price such that D{p) > 0. The strategy of each player is 
to announce the price at which he would be willing to supply an arbitrary amount of the 
good in question. In this case the payoff function takes a radically different form. It is 
plausible to suppose that the consumers will only purchase from the firm with the lowest 
price, and that they will split evenly between the two firms if they charge the same price. 
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This leads to a payoff to firm 1 of the form: 



7Tl(Pl,P2) 



(pi - c)x(p 1 ) «/ pi < p 2 
(pi - c)x(p 1 )/2 if p 1 = p 2 
if pi > p 2 



Note that the Cournot game and the Bertrand game have a radically different struc- 
ture. In the Cournot game, the payoff to each firm is a continuous function of its strategic 
choice; in the Bertrand game, the payoffs are discontinuous functions of the strategies. 
What is the Nash equilibrium? It may be somewhat surprising, but in the unique Nash 
equilibrium, both firms charge a price equal to marginal cost, and both earn zero profit. 
Formerly, we have 

Proposition 6.6.1 There is a unique Nash equilibrium (p±,p 2 ) in the Bertrand duopoly. 
In this equilibrium, both firms set their price equal to the marginal cost: pi = p 2 = c and 
earn zero profit. 

Proof. First note that both firms setting their prices equal to c is indeed a Nash 
equilibrium. Neither firm can gain by rasing its price because it will then make no sales 
(thereby still earning zero); and by lowering its price below c a firm increase it sales but 
incurs losses. What remains is to show that there can be no other Nash equilibrium. 
Because each firm i chooses Pi ^ c, it suffices to show that there are no equilibria in which 
Pi > c for some i. So let (pi,p 2 ) be an equilibrium. 

If pi > c, then because p 2 maximizes firm 2's profits given firm l's price choice, we 
must have p 2 G (c,pi], because some such choice earns firm 2 strictly positive profits, 
whereas all other choices earns firm 2 zero profits. Moreover, p 2 ^ p\ because if firm 
2 can earn positive profits by choosing p 2 = p\ and splitting the market, it can earn 
even higher profits by choosing p 2 just slightly below p x and supply the entire market at 
virtually the same price. Therefore, pi > c implies that p 2 > c and p 2 < p 1 . But by 
stitching the roles of firms 1 and 2, an analogous argument establishes that p 2 > c implies 
that p\ > c and p\ < p 2 . Consequently, if one firm's price is above marginal cost, both 
prices must be above marginal cost and each firm must be strictly undercutting the other, 
which is impossible. 
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6.6.4 Collusion 

All of the models described up until now are examples of non-cooperative games. Each 
firm maximizes its own profits and makes its decisions independently of the other firms. 
What happens if they coordinate their actions? An industry structure where the firms 
collude to some degree in setting their prices and outputs is called a cartel. 

A natural model is to consider what happens if the two firms choose simultaneously 
choose yi and y 2 to maximize industry profits: 

maxp(|/i + y 2 )(y 1 + y 2 ) - Ci(yi) - c 2 (y 2 ) 
yit/2 

The first-order conditions are 

p'(y*i+y*2)(yl + y*2)+p(yl + y*2) = CM) 
p'(y*i+y*2M + y*2)+p(y*i+y*2) = CM) 

It is easy to see from the above first-order conditions that profit maximization implies 

c'M) = CM). 

The problem with the cartel solution is that it is not "stable" unless they are complete- 
ly merged. There is always a temptation to cheat: to produce more than the agreed-upon 
output. Indeed, when the other firm will hold its production constant, we have by rear- 
ranging the first-order condition of firm 1: 

dni f y f 2) = p'(y*i + yl)yl + p(y\ + vl) - <k(ti) = -p'(yt + vl)vl > o 

by noting the fact that the demand curves slope downward. 

The strategic situation is similar to the Prisoner's Dilemma: if you think that other 
firm will produce its quota, it pays you to defect to produce more than your quota. And 
if you think that the other firm will not produce at its quota, then it will in general be 
profitable for you to produce more than your quota. 

In order to make the cartel outcome viable, some punishment mechanism should be 
provided for the cheat on the cartel agreement, say using a repeated game as discussed 
in the previous chapter. 
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6.7 Monopsony 



In our discussion up until now we have generally assumed that we have been dealing 
with the market for an output good. Output markets can be classified as "competitive" 
or "monopolistic" depending on whether firms take the market price as given or whether 
firms take the demand behavior of consumers as given. 

There is similar classification for inputs markets. If firms take the factor prices as given, 
then we have competitive factor markets. If instead there is only one firm which demands 
some factor of production and it takes the supply behavior of its suppliers into account, 
then we say we have a monopsonistic factor market. The behavior of a monopsonist is 
analogous to that of a monopolist. Let us consider a simple example of a firm that is a 
competitor in its output market but is the sole purchaser of some input good. Let w(x) 
be the (inverse) supply curve of this factor of production. Then the profit maximization 
problem is: 



This just says that the marginal revenue product of an additional unit of the input good 
should be equal to its marginal cost. Another way to write the condition is: 



where e is the price elasticity of supply. As e goes to infinity the behavior of a monopsonist 
approaches that of a pure competitor. 

Recall that in Chapter 3, where we defined the cost function of a firm, we considered 
only the behavior of a firm with competitive factor markets. However, it is certainly 
possible to define a cost function for a monopsonistic firm. Suppose for example that 
x, (w) is the supply curve for factor i. Then we can define: 



maxpf(x) — w(x)x 



The first-order condition is: 



pf(x*) - w(x*) - w'(x*)x* = 



P 



df(x*) 



w(x*)[l + l/e] 



dx 





s.t 



f(x(w)) = y 



This just gives us the minimum cost of producing y. 
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Part III 

General Equilibrium Theory and 

Social Welfare 
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Part III is devoted to an examination of competitive market economies from a general 
equilibrium perspective at which all prices are variable and equilibrium requires that all 
markets clear. 

The content of Part III is organized into four chapters. Chapters 7 and 8 constitute 
the heart of the general equilibrium theory. Chapter 7 presents the formal structure of the 
equilibrium model, introduces the notion of competitive equilibrium (or called Walrasian 
equilibrium). The emphasis is on positive properties of the competitive equilibrium. We 
will discuss the existence, uniqueness, and stability of a competitive equilibrium. We will 
also discuss a more general setting of equilibrium analysis, namely the abstract economy 
which includes the general equilibrium model as a special case. Chapter 8 discusses 
the normative properties of the competitive equilibrium by introducing the notion of 
Pareto efficiency We examine the relationship between the competitive equilibrium and 
Pareto optimality. The core is concerned with the proof of the two fundamental theorems 
of welfare economics. Chapter 9 explores extensions of the basic analysis presented in 
Chapters 7 and 8. Chapter 9 covers a number of topics whose origins lie in normative 
theory. We will study the important core equivalence theorem that takes the idea of 
Walrasian equilibria as the limit of cooperative equilibria as markets grow large, fairness of 
allocation, and social choice theory. Chapter 10 applies the general equilibrium framework 
developed in Chapters 7 to 9 to economic situations involving the exchange and allocation 
of resources under conditions of uncertainty. 

It is greatly acknowledged that some materials are drawn from the references in the 
end of each charter. 
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Chapter 7 

Positive Theory of Equilibrium: 
Existence, Uniqueness, and Stability 

7.1 Introduction 

One of the great achievements of economic theory in the last century is general equilibrium 
theory. General equilibrium theory is a branch of theoretical neoclassical economics. 
It seeks to explain the behavior of supply, demand and prices in a whole economy, by 
considering equilibrium in many markets simultaneously, unlike partial equilibrium theory 
which considers only one market at a time. Interaction between markets may result in 
a conclusion that is not obtained in a partial equilibrium framework. As such, it is a 
benchmark model to study market economy and so an abstraction from a real economy; 
it is proposed as being a useful model, both by considering equilibrium prices as long-term 
prices and by considering actual prices as deviations from equilibrium. The theory dates 
to the 1870s, particularly the work of French economist Leon Walras, and thus it is often 
called the Walrasian theory of market. 

From a positive viewpoint, the general equilibrium theory is a theory of the determi- 
nation of equilibrium prices and quantities in a system of perfectly competitive markets. 
The proof of existence of a general equilibrium is generally considered one of the most 
important and robust results of economic theory. The mathematical model of a compet- 
itive economy of L. Walras (1874-77) was conceived as an attempt to explain the state 
of equilibrium reached by a large number of small agents interaction through markets. 
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Walras himself perceived that the theory would be vacuous without a mathematical argu- 
ment in support the existence of at least one equilibrium state. However, for more than 
half a century the equality of the number of equations and of the number of unknowns 
of the Walrasian model remained the only, unconvincing remark made in favor of the 
existence of a competitive equilibrium. Study of the existence problem began in the early 
1930s when Neisser (1932), Stackelberg (1933), Zeuthen (1933), and Schesinger (1935) 
identified some of its basic features and when Wald (1935, 1936a, 1936b) obtained its first 
solutions. After an interruption of some twenty years, the questions of existence of an 
economic equilibrium was taken up again by Arrow and Debreu (1954), McKenzie (1954, 
1955), Gale (1955), and many others. 

Although the first proof of existence of a competitive equilibrium was obtained by 
Wald, von Neumann (1928, 1937) turned out to be of greater importance of the subject. 
He proved a topological lemma which, in its reformulation by Kakutani (1941) as a fixed- 
point theorem for a correspondence, because the most powerful tool for the proof of 
existence of an economic equilibrium. 

A general equilibrium is defined as a state where the aggregate demand does not 
excess the aggregate supply for all markets. Thus, equilibrium prices are endogenously 
determined. 

The general equilibrium approach has two central features: 

(1) It views the economy as a closed and inter-related system in which we 
must simultaneously determine the equilibrium values of all variables of 
interests (consider all markets together). 

(2) It aims at reducing the set of variables taken as exogenous to a small 
number of physical realities. 

It is to predict the final consumption and production in the market mechanism. 
The general equilibrium theory consists of five components: 

1. Economic institutional environment (the fundamentals of the economy): 
economy that consists of consumption space, preferences, endowments of 
consumers, and production possibility sets of producers. 
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2. Economic institutional arrangement: It is the price mechanism in which a 
price is quoted for every commodity. 

3. The behavior assumptions: price taking behavior for consumers and firms, 
utility maximization and profit maximization. 

4. Predicting outcomes: equilibrium analysis - - positive analysis such as 
existence, uniqueness, and stability. 

5. Evaluating outcomes: normative analysis such as allocative efficiency of 
general equilibrium. 

Questions to be answered in the general equilibrium theory. 

A. The existence and determination of a general equilibrium: What kinds 
of restrictions on economic environments (consumption sets, endowments, 
preferences, production sets) would guarantee the existence of a general 
equilibrium. 

B. Uniqueness of a general equilibrium: What kinds of restrictions on economic 
environments would guarantee a general equilibrium to be unique? 

C. Stability of a general equilibrium: What kinds of restrictions economic 
environments would guarantee us to find a general equilibrium by changing 
prices, especially rasing the price if excess demand prevails and lowering it 
if excess supply prevails? 

D. Welfare properties of a general equilibrium: What kinds of restrictions on 
consumption sets, endowments, preferences, production sets would ensure 
a general equilibrium to be social optimal - Pareto efficient? 

7.2 The Structure of General Equilibrium Model 
7.2.1 Economic Environments 

The fundamentals of the economy are economic institutional environments that are ex- 
ogenously given and characterized by the following terms: 
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n: the number of consumers 
N — {1, . . . , n}: the set of consumers 
J: the number of producers (firms) 
L: the number of (private) goods 

Xi C 9ft L : the consumption space of consumer i — 1, . . . , n, which specifies the 
boundary of consumptions, collection of all individually feasible consump- 
tions of consumer i. Some components of an element may be negative such 
as a labor supply; 

preferences ordering (or Ui if a utility function exists) of consumer i = 
l,...,ra; 

Remark: >p { is a preference ordering if it is reflexive (xj )pi Xj), transitive 
(xj x\ and x^ x" implies Xj )>=j x'/), and complete (for any pair Xj and 
x'j, either x, >j x\ or x^ >j Xj). It can be shown that it can be represented 
by a continuous utility function if £=j are continuous. The existence of 
general equilibrium can be obtained even when preferences are weakened 
to be non-complete or non-transitive. 

Wi G Xi: initial endowment vector of consumer i. 

ti = (Xi, >pi,Wi): the characteristic of consumer i. 

Yj : production possibility set of firm j = 1, 2, . . . , J, which is the characteristic 
of producer j. 

Uj G Yj: a production plan, > means y l j is output and < means yj is 
input. Most elements of yj for a firm are zero. 

Recall that there can be three types of returns about production scales: non- 
increasing returns to scale (i.e., yj G Yj implies that ayj G Yj for all 
a G [0,1]), non-decreasing (i.e., yj G Yj implies that ayj G Yj for all 
a ^ 1) returns to scale, and constant returns to scale i.e., yj G Yj implies 
that ayj G Yj for all a ^ 0). In other words, decreasing returns to scale 
implies any feasible input-output vector can be scaled down; increasing 
returns to scale implies any feasible input-output vector can be scaled up, 
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constant returns to scale implies the production set is the conjunction of 
increasing returns and decreasing returns. Geometrically, it is a cone. 




Figure 7.1: Various Returns to Scale: IRS, DRS, and CRS. 

e = ({Xi, >pi,Wi}, {Yj}): an economy, or called an economic environment. 
X = X 1 x X 2 x . . . x X n : consumption space. 
Y = Y"i x Y 2 x . . . x Yj\ production space. 

7.2.2 Institutional Arrangement: Private Market Mechanism 

p = {p l ,p 2 -, • • . ,p L ) GRj: a price vector; 

pxc the expenditure of consumer i for i — 1, . . . , n\ 

pyf the profit of firm j for j — 1, . . . , J; 

pwi. the value of endowments of consumer i for i — 1, . . . , n\ 

9ij G 3?+: the profit share of consumer i from firm j, which specifies ownership 
(property rights) structures, so that Y^i=i @ij = 1 f° r J = 1> 2, . . . , J; 

J2j=i @ijPUj = the total profit dividend received by consumer % from firms for 
i — 1, . . . , n. 
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For i — 1, 2, . . . , n, consumer i's budget constraint is given by 

j 



pXi + e ijPVj ( 7A ) 

3=1 



and the budget set is given by 



Biip) = {xi E Xi : pXi + ijPVj}- ( 7 - 2 ) 

3=1 

A private ownership economy then is referred to 

e = (e 1 ,e 2 ,...,e n ,{Y j }! =1 ,{O ij }). (7.3) 
The set of all such private ownership economies are denoted by E. 

7.2.3 Individual Behavior Assumptions: 

(1) Perfect competitive markets: Every player is a price-taker. 

(2) Utility maximization: Every consumer maximizes his preferences subject 
to Biip). That is, 

max-Uj(xj) (7.4) 

s.t. 

j 

pxi ^pwi + ^2 Qijpyj ( 7 - 5 ) 

3=1 

(3) Profit maximization: Every firm maximizes its profit in Yj. That is, 

max pyj (7.6) 

for j = 1, . . . , J. 

7.2.4 Competitive Equilibrium 

Before defining the notion of competitive equilibrium, we first give some notions on alloca- 
tions which identify the set of possible outcomes in economy e. For notational convenience, 
"a" will be used throughout the notes to denote the sum of vectors a i; i.e., a := X] a «- 
Allocation: 
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An allocation (x,y) is a specification of consumption vector x = (x±, . . . ,x n ) and 
production vector y — (y±, . . . , yj). 

An allocation (x, y) is individually feasible if Xi G X i for all i G N, yj G Yj for all 

3 = 1,..., J. 

An allocation is weakly balanced 

x ^ y + w (7.7) 

or specifically 

n J n 

1=1 j=l j=l 

When inequality holds with equality, the allocation is called a balanced or attainable 
allocation. 

An allocation (x, y) is feasible if it is both individually feasible and (weakly) balanced. 

Thus, an economic allocation is feasible if the total amount of each good consumed does 
not exceed the total amount available from both the initial endowment and production. 
Denote by A = {(x, y) eXxY: x^y + w} the set of all feasible allocations. 

Aggregation: 

x = Y^=i Xi '- aggregation of consumption; 

y = Ylj=i Vf aggregation of production; 

w = Y17=i w i' aggregation of endowments; 

Now we define the notion of competitive equilibrium. 

Definition 7.2.1 (Competitive Equilibrium or also called Walrasian Equilibrium) 

Given a private ownership economy, e = (e 1 , . . . , e n , {Yj}, {Oij}), an allocation (x,y) G 
X x Y and a price vector p G 9^ consist of a competitive equilibrium if the following 
conditions are satisfied 

(i) Utility maximization: Xi ^ x\ for all x\ G B^p) and X; L G B^p) for 
i — 1, . . . , n. 

(ii) Profit maximization: pyj ^ py'j for y'j G Yj. 

(iii) Market Clear Condition: x ^ w + y. 

Denote 
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x iip) — i x i e Bi(p) : Xi G -Bj(p) and X{ >pi for all x\ G £?j(p)}: the demand 
correspondence of consumer % under utility maximization; it is called the 
demand function of consumer i if it is a single-valued function. 



Ujip) — {Uj : PZ/j = f° r & H Z/j e the supply correspondence of 

the firm j; it is called the supply function of firm j if it is a single- valued 



function. 



xip) 



Y^i=i x i{p) '■ the aggregate demand correspondence. 



yip) 



J2j = iVj(p) '■ the aggregate supply correspondence. 



z{p) = xip) — w — yip) : aggregate excess demand correspondence. 

An equivalent definition of competitive equilibrium then is that a price vector p* G 3?^ 
is a competitive equilibrium price if there exists z G zip*) such that z ^ 0. 
If zip) is a single-valued, zip*) ^ is a competitive equilibrium. 



7.3 Some Examples of GE Models: Graphical Treat- 



in most economies, there are three types of economic activities: production, consumption, 
and exchanges. Before formally stating the existence results on competitive equilibrium, 
we will first give two simple examples of general equilibrium models: exchange economies 
and a production economy with only one consumer and one firm. These examples intro- 
duce some of the questions, concepts, and common techniques that will occupy us for the 
rest of this part. 

7.3.1 Pure Exchange Economies 

A pure exchange economy is an economy in which there is no production. This is a 
special case of general economy. In this case, economic activities only consist of trading 
and consumption. 

The aggregate excess demand correspondence becomes zip) = xip) —w so that we can 
define the individual excess demand by Ziip) = x^p) — Wi for this special case. 



ment 
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The simplest exchange economy with the possibility of mutual benefit exchange is the 
two-good and two-consumer exchange economy. As it turns out, this case is amenable to 
analysis by an extremely handy graphical device known as the Edgeworth Box. 

• Edgeworth Box: 

Consider an exchange economy with two goods (xi,x 2 ) and two persons. The total 
endowment is w = w± + w 2 . For example, if w± = (l,2),w 2 = (3,1), then the total 
endowment is: w = (4, 3). Note that the point, denoted by w in the Edgeworth Box, can 
be used to represent the initial endowments of two persons. 




Figure 7.2: Edgeworth Box in which w 1 = (1,2) and w 2 = (3, 1). 

Advantage of the Edgeworth Box is that it gives all the possible (balanced) trading 
points. That is, 

x l + x 2 = wi + w 2 (7.9) 

for all points x = (x 1 ,x 2 ) in the box, where x 1 = (x\,xl) and x 2 = (x^x^). Thus, every 
point in the Edgeworth Box stands for an attainable allocation so that x\ + x 2 = w\ + w 2 . 

The shaded lens (portion) of the box in the above figure represents all the trading 
points that make both persons better off. Beyond the box, any point is not feasible. 

Which point in the Edgeworth Box can be a competitive equilibrium? 
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Figure 7.3: In the Edgeworth Box, the point CE is a competitive equilibrium. 

In the box, one person's budget line is also the budget line of the other person. They 
share the same budget line in the box. 

Figure 7.4 below shows the market adjustment process to a competitive equilibrium. 
Originally, at price p, both persons want more good 2. This implies that the price of good 
1, p 1 , is too high so that consumers do not consume the total amounts of the good so that 
there is a surplus for x\ and there is an excess demand for x 2 , that is, x\ + x\ < w\ + w\ 
and x\ + x\ > w\ + w\. Thus, the market will adjust itself by decreasing p 1 to p 1 '. As a 
result, the budget line will become flatter and flatter till it reaches the equilibrium where 
the aggregate demand equals the aggregate supply. In this interior equilibrium case, two 
indifference curves are tangent each other at a point that is on the budget line so that 
the marginal rates of substitutions for the two persons are the same that is equal to the 
price ratio. 
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Figure 7.4: This figure shows the market adjustment process. 



What happens when indifference curves are linear? 




Figure 7.5: A competitive equilibrium may still exist even if two persons' indifference 
curves do not intersect. 
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In this case, there is no tangent point as long as the slopes of the indifference curves 
of the two persons are not the same. Even so, there still exists a competitive equilibrium 
although the marginal rates of substitutions for the two persons are not the same. 

Offer Curve: the locus of the optimal consumptions for the two goods when price 
varies. Note that, it consists of tangent points of the indifference curves and budget lines 
when price varies. 

The offer curves of two persons are given in the following figure: 




Figure 7.6: The CE is characterized by the intersection of two persons' offer curves. 

The intersection of the two offer curves of the consumers can be used to check if there 
is a competitive equilibrium. By the above diagram, one can see that the one intersection 
of two consumers' offer curves is always given at the endowment w. If there is another 
intersection of two consumers' offer curves rather than the one at the endowment w, the 
intersection point must be the competitive equilibrium. 

To enhance the understanding about the competitive equilibrium, you try to draw the 
competitive equilibrium in the following situations: 

1. There are many equilibrium prices. 
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2. One person's preference is such that two goods are perfect substitutes (i.e., 
indifference curves are linear). 

3. Preferences of one person are the Leontief-type (perfect complement) 

4. One person's preferences are non-convex. 

5. One person's preferences are "thick" . 

6. One person's preferences are convex, but has a satiation point. 

Note that a preference relation )pi is convex if x >~i x' implies tx + (1 — t)x' >-j x' for 
all t G (0, 1) and all x, x' e X; L . A preference relation ^ has a satiation point x if x ^ x' 
all i' e Ij. 

Cases in which there may be no Walrasian Equilibria: 

Case 1. Indifference curves (IC) are not convex. If the two offer curves are not 
intersected except for at the endowment points, then there may not exist a competitive 
equilibrium. This may be true when preferences are not convex. 



Figure 7.7: A CE may not exist if indifference curves are not convex. In that case, two 
persons' offer curves are not tangent. 

Case 2. The initial endowment may not be an interior point of the consumption 
space. Consider an exchange economy in which one person's indifference curves put no 
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values on one commodity, say, when Person 2's indifference curves are vertical lines so 
that u<i{x\-,x\) = x\. Person l's utility function is regular one, say, which is given by a 
quasi-linear utility function, say, U\ = \fx\ + x\. The initial endowments are given by 
w\ = (0,1) and w 2 = (1,0). There is no competitive equilibrium. Why? There are two 
cases to be considered. 

Case (a), p 1 = or p 2 = 0. Person 1 would demand infinite amount of the good whose 
price is zero. Thus, there is no competitive equilibrium in this case. 

Case (b). p 1 > and p 2 > 0. Person 2's demand for good 1 is x\ — 1, and person 1 
would demand a positive quantity of good 1 because of of the quasi-linear utility function. 
As such, the aggregate demand for good 1 is greater than one, that is, x\(p) + x\{p) > w 1 
which violates the feasibility conditions. Thus, no CE exists. 
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Figure 7.8: A CE may not exist if an endowment is on the boundary. 

7.3.2 The One-Consumer and One Producer Economy 

Now we introduce the possibility of production. To do so, in the simplest-possible setting 
in which there are only two price-taking economic agents. 
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Two agents: one producer so that J = 1 and one consumer so that n — 1. 

Two goods: labor (leisure) and the consumption good produced by the firm. 
w = (L,0): the endowment. 

L: the total units of leisure time. 

f(z) :— > R + : the production function that is strictly increasing, concave, and 
differentiable, where z is labor input. To have an interior solution, we 
assume / satisfies the Inada condition /'(0) = +00 and lim 2 ^ f'( z ) z — 0. 

(p,ou): the price vector of the consumption good and labor. 

— 1: single person economy. 

u(x l ,x 2 ) :— > R 2 + : is the utility function which is strictly quasi- concave, in- 
creasing, and differentiable. To have an interior solution, we assume u 
satisfies the Inada condition J^(0) = +00 and lim^.^o ^r. x i — 0- 

The firm's problem is to choose the labor z so as to solve 

maxp/(z) — ojz (7-10) 

2:^0 

FOC: 

pf'(z) = CO 

=> f'( z ) = u/p 

(MRTS z>q = Price ratio) 

which means the marginal rate of technique substitution of labor for the consumption 
good q equals the price ratio of the labor input and the consumption good output. 
Let 

q(p, uj) = the profit maximizing output for the consumption good. 
z{p, 00) = the profit maximizing input for the labor. 
ir(p, to) = the profit maximizing function. 
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A Walrasian 
equilibrium. 



Figure 7.9: Figures for the producer's problem, the consumer problem and the CE. 

The consumer's problem is to choose the leisure time and the consumption for the 
good so as to solve 

maxwfx 1 , x 2 ) 
s.t. px 2 ^ ui(L — x 1 ) + 7r(p, u) 
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where x 1 is the leisure and x 2 is the consumption of good. 
(FOC:) 



du 
dx 1 
du 



(7.11) 



8x2 V 

which means the marginal rate of substitution of the leisure consumption for the con- 
sumption good q equals the price ratio of the leisure and the consumption good, i.e., 
MRS x i x 2 = u/p. 

By (7.10) and (7.11) 

MRS x i x 2 = ^ = MRTS z>q (7.12) 
A competitive equilibrium for this economy involves a price vector (p*,u*) at which 

x 2 (p*,co*) = q(p*,tu*); 
x\p\u*) + z(p*,u*) = L 

That is, the aggregate demand for the two goods equals the aggregate supply for the 
two goods. Figure 7.9 shows the problems of firm and consumer, and the competitive 
equilibrium, respectively. 



7.4 The Existence of Competitive Equilibrium 

The proof of the existence of a competitive equilibrium is generally considered one of 
the most important and robust results of economic theory. There are many different 
ways of establishing the existence of competitive equilibria, including the "excess demand 
approach" by showing that there is a price at which excess demand can be non-positive. 

In this section we will examine the existence of competitive equilibrium for the three 
cases: (1) the single- valued aggregate excess demand function; (2) the aggregate excess 
demand correspondence; (3) a general class of private ownership production economies. 
The first two cases are based on excess demand instead of underlying preference order- 
ings and consumption and production sets. There are many ways to prove the existence 
of general equilibrium. For instance, one can use the Brouwer fixed point theorem ap- 
proach, KKM lemma approach, and abstract economy approach to show the existence of 
competitive equilibrium for these three cases. 
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7.4.1 The Existence of CE for Aggregate Excess Demand Func- 
tions 

The simplest case for the existence of a competitive equilibrium is the one when the 
aggregate excess demand correspondence is a single- valued function. Note that, when 
preference orderings and production sets are both strictly convex, we obtain excess de- 
mand functions rather than correspondences. 

A very important property of excess demand function z(p) is Walras' law, which can 
take one of the following three forms: 

(1) the strong Walras' law given by 

p ■ z{p) = for all p e 3?+; 

(2) the weak Walras' law given by 

P ■ z{p) < for all p e 3?+; 

(3) the interior Walras' law given by 

p ■ z(p) = for all p G 3^+ + - 

Another important property of excess demand function is homogeneity of zip): it is 
homogeneous of degree in price z(\p) = z(p) for any A > 0. From this property, we can 
normalize prices. 

Because of homogeneity, for example, we can normalize a price vector as follows: 
{l)p a =p l /p l Z = 1,2,...,L 
(2) p"=p7£f=iP'- 

Thus, without loss of generality, we can restrict our attention to the unit simplex: 

L 

S L ~ l = {pG^:^p' = 1}. (7.13) 
Then, we have the following theorem on the existence of competitive equilibrium. 
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Theorem 7.4.1 (The Existence Theorem I)) For a private ownership economy e = 
({Xi,Wi, , {Yj}, {9ij; }), if z{p) is a homogeneous of degree zero and continuous func- 
tion and satisfies the strong Walras ' Law, then there exists a competitive equilibrium, that 
is, there is p* G 9?+ such that 

zip*) <\ (7.14) 
Proof: Define a continuous function g : S 1 ^ 1 — > S^ 1 by 

dip) = P ' + maX{ °'^ )} (7-15) 
l + £Lirnax{0,^(p)} 

for I = 1,2,..., L. 

First note that g is a continuous function since max{f(x), h(x)} is continuous when 
fix) and hix) are continuous. 

By Brouwer's fixed point theorem, there exists a price vector p* such that gip*) — p* , 

i.e., 

P* = * 'r^ l T*".„ / = 1,2,...,L. (7.16) 



, _ p* l + max{0,z l (p*)} 

i + £Li max {M fe (p*)} 



We want to show p* is in fact a competitive equilibrium price vector. 
Cross multiplying 1 + J2k=i max{0, z k ip*)} on both sides of (7.16), we have 

L 



p*^max{0,i fc (p*)} = max{0,i'(p*)}. (7.17) 
Then, multiplying the above equation by z\p*) and making summation, we have 



i=i 



L 



^i'(p*)max{0,i'(p*)}. (7.18) 



^max{0,i'(p*)} 
.1=1 

Then, by the strong Walras' Law, we have 

L 

^i'(p*)max{0,^(p*)} = 0. (7.19) 
i=i 

Therefore, each term of the summations is either zero or (z l ip*)) 2 > 0. Thus, to have 
the summation to be zero, we must have each term to be zero. That is, z l ip*) ^ for 
I — 1, ... ,L. The proof is completed. 

Remark 7.4.1 Do not confuse competitive equilibrium of an aggregate excess demand 
function with the strong Walras' Law. Even though Walras' Law holds, we may not have 
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zip) ^ for all p. Also, if zip*) < for some p*, i.e., p* is a competitive equilibrium price 
vector, the the strong Walras' Law may not hold unless some types of monotonicity are 
imposed such as local non-satiation. 

Fact 1: (free goods). Under the strong Walras' Law, if p* is a competitive equilib- 
rium price vector and z l (p*) < 0, then p* 1 = 0. 

Proof. Suppose not. Then p* 1 > 0. Thus, p* l z\p*) < 0, and so p*z(p*) < 0, contra- 
dicting the strong Walras' Law. 

Desirable goods: if p l = 0, z l (p*) > 0. 

Fact 2: (Equality of demand and supply). If all goods are desirable and p* is a 
competitive equilibrium price vector, then zip*) = 0. 

Proof. Suppose not. We have z l {p*) < for some /. Then, by Fact 1, we have p* 1 = 0. 
Since good I is desirable, we must have z l (p* 1 ) > 0, a contradiction. 

Remark 7.4.2 By the strong Walras' Law, if p > 0, and if (L — 1) markets are in the 
equilibrium, the L-th market is also in the equilibrium. Thus, because of the strong 
Walras's Law, to verity that a price vector p > clears all markets, it suffices check that 
it clear all markets, but one. 

The above result requires the aggregate excess demand function is continuous. By 
using the KKM lemma, we can prove the existence of competitive equilibrium by only 
assuming the aggregate excess demand function is lower semi-continuous. 

Theorem 7.4.2 (The Existence Theorem I') For a private ownership economy e = 
{{Xi,Wi, ^j} , {Yj}, {Oij}), if the aggregate excess demand function z{p) is a lower semi- 
continuous function and satisfies strong or weak Walras ' Law, then there exists a compet- 
itive equilibrium, that is, there is p* G S^ 1 such that 

zip*) S 0. (7.20) 

Proof. Define a correspondence F : S' L_1 — > 2 sL 1 by, 

F{q) = {pe S 1 - 1 : qz(p) ^ 0} for all q G S L ~\ 

First note that F(q) is nonempty for each q G S' L ~ 1 since q G F{q) by strong or weak 
Walras' Law. Since p ^ and £(•) is lower semi-continuous, the function defined by 
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4>(q,p) = qz(p) = J2 i=1 q l z l (p) is lower semi-continuous in p. Hence, the set F(q) is closed 
for all q G S L_1 . We now prove F is FS-convex. Suppose, by way of contradiction, that 
there some qi,...,q m G S^ 1 and some convex combination q\ = YltLi^tQt such that 
qx U^ =1 F(q t ). Then, q\ F(q t ) and therefore qtz(q\) > for all t — 1, . . . , m. Thus, 
Y^t=i ^tqtz(q\) = q\z(q\) > which contradicts the fact that z satisfies strong or weak 
Walras' Law. So F must be FS-convex. Therefore, by KKM lemma, we have 

n qeS F(q) ^ 0. 

Then there exists a p* G S 1 ^ 1 such that p* G n qeS L-iF(q), i.e., p* G F(q) for all q G S^ -1 . 
Thus, qz(p*) ^ for all q G Now let q l = (1, 0, . . . , 0), q 2 = (0, 1, 0, . . . , 0), and 

q n = (0, . . . , 0, 1). Then q t G S 1 - 1 and thus = z\p*) ^ for all t = 1, . . . , L. Thus 

we have z(jf) ^ 0, which means p* is a competitive equilibrium price vector. The proof 
is completed. 

The above two existence theorems only give sufficient conditions. Recently, Tian 
(2013) provides a complete solution to the existence of competitive equilibrium in e- 
conomies with general excess demand functions, in which commodities may be indivisible 
and excess demand functions may be discontinuous or do not have any structure except 
Walras' law. 

Tian introduces a very week condition, called recursive transfer lower semi- continuity, 
which is weaker than transfer lower semi- continuity and in turn weaker than lower semi- 
continuity. It is shown that the condition, together with Walras's law, guarantees the 
existence of price equilibrium in economies with excess demand functions. The condition 
is also necessary, and thus our results generalize all the existing results on the existence 
of price equilibrium in economies where excess demand is a function. For convenience of 
discussion, we introduce the following term. 

We say that price system p upsets price system q if g's excess demand is not affordable 
at price p, i.e., p ■ z(q) > 0. 

Definition 7.4.1 An excess demand function £(•) : S^ 1 — > R L is transfer lower semi- 
continuous if for all q,p G S' L_1 , p ■ z(q) > implies that there exists some point p' G X 
and some neighborhood N(q) of q such that p' ■ z(q') > for all q' G Af(q). 

Remark 7.4.3 The transfer lower semi-continuity of £(•) means that, if the aggregate 
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excess demand z(q) at price vector q is not affordable at price system p, then there exists 
a price system p' such that z(q') is also not affordable at price system p', provided q' is 
sufficiently close to q. Note that, since p > 0, this condition is satisfied if z(-) is lower 
semi-continuous by letting p' = p. 

Definition 7.4.2 (Recursive Upset Pricing) Let £(•) : S 1 ^ 1 — > 1R L be an excess demand 
function. We say that a non-equilibrium price system p° G S 1 ' 1 is recursively upset by 
p G S^ 1 if there exists a finite set of price systems {p x ,p 2 , ■ ■ ■ ,p} such that p 1 ■ z(p°) > 0, 
p 2 -z{p l ) >0, ..,p-i(p m - 1 ) >0. 

In words, a non-equilibrium price system p° is recursively upset by p means that 
there exist finite upsetting price systems p\p 2 , . . . ,p m with p m = p such that p 0, s excess 
demand is not affordable at p 1 , p x, s excess demand is not affordable at p 2 , and p m_1 's 
excess demand is not affordable at p m . When the strong form of Walras' law holds, this 
implies that p° is upset by p 1 , p 1 is upset by p 2 , p m ~ x is upset by p. 

For convenience, we say p° is directly upset by p when m — 1, and indirectly upset 
by p when m > 1. Recursive upsetting says that nonequilibrium price system p° can be 
directly or indirectly upset by a price system q through sequential upsetting price systems 
{p 1 iP 2 i ■ ■ ■ ,p m_1 } in a recursive way that p° is upset by p 1 , p l is upset by p 2 , . . ., and 
p m ~ x is upset by p. 

Definition 7.4.3 (Recursive Transfer Lower Semi-Continuity) An excess demand func- 
tion z(-) : S^ 1 — > M. L is said to be recursively transfer lower semi- continuous on S' L ~ 1 if, 
whenever q G S 1 ^ 1 is not a competitive equilibrium price system, there exists some price 
system p° G S 1 "^ 1 (possibly p° = q) and a neighborhood V q such that p ■ z(V q ) > for any 
p that recursively upsets p°, where p ■ z(V q ) > means p ■ z(q') > for all q' G V g . 

In the definition of recursive transfer lower semi-continuity, q is transferred to q° that 
could be any point in A. Roughly speaking, recursive transfer lower semi-continuity of 
z(-) means that, whenever q is not an equilibrium price system, there exists another 
nonequilibrium price system p° such that all excess demands in some neighborhood of q 
are not affordable at any price system p that recursively upsets p°. This implies that, if an 
excess demand function £(•) : A — > M. L is not recursively transfer lower semi-continuous, 
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then there is some non-equilibrium price system q such that for every other price system p° 
and every neighborhood of q, excess demand of some price system q' in the neighborhood 
becomes affordable at price system p that recursively upsets p°. 

Remark 7.4.4 Recursive transfer lower semi-continuity is weaker than lower semi-continuity. 
Indeed, when z(-) is lower semi-continuous, p ■ z(-) is also lower semi-continuous for any 
nonnegative vector p, and thus we have p ■ z(q') > for all q' G V q and p G S L_1 . Then, 
for any finite price vectors {p 1 ,^ 2 , • • • ,p m }, we of course have p k ■ z(q') > for all q' G V q , 
k = 1, . . . , to, which means £(•) is transfer lower semi-continuous. 

We then have the following theorem of competitive equilibrium in economies that have 
single-valued excess demand functions. 

Theorem 7.4.3 (The Existence Theorem I")) Suppose an excess demand function 
z(-) : S^ 1 — ?■ M. L satisfies either the strong or weak Walras' law. Then there exists a 
competitive price equilibrium p* G S 1 ^ 1 if and only if z(-) is recursively transfer lower 
semi- continuous on S 1 ^ 1 . 

Proof. Sufficiency (<^=). Suppose, by way of contradiction, that there is no price 
equilibrium. Then, by recursive transfer lower semi-continuity of £(•), for each q G S L_1 , 
there exists p° and a neighborhood V q such that p ■ z(V q ) > whenever p° G S' L_1 is 
recursively upset by p, i.e., for any sequence of recursive price systems {p 1 , . . . ,j9 m_1 ,p} 
with p-z{p m ~ l ) > 0, p m ~ l ■ z{p m ~ 2 ) > 0, . . ., p 1 -z(p°) > for to > 1, we have p-z(Vg) > 0. 
Since there is no price equilibrium by the contrapositive hypothesis, p° is not a price 
equilibrium and thus, by recursive transfer lower semi-continuity, such a sequence of 
recursive price systems {p 1 , . . . ,p m_1 ,p} exists for some to > 1. 

Since S 1 "' 1 is compact and S' L_1 C U qeS L-i V q , there is a finite set {q 1 , . . . , q T } such 
that S^ 1 C \Jf =1 V q i. For each of such q\ the corresponding initial price system is 
denoted by p 0t so that p l ■ z(V q i) > whenever p° l is recursively upset by p\ 

Since there is no price equilibrium, for each of such p°\ there exists p % such that 
p l ■z(p Ql ) > 0, and then, by 1-recursive transfer lower semi-continuity, we have p % -z(V q i) > 
0. Now consider the set of price systems {p 1 , . . . ,p T }. Then, p % £ V q i] otherwise, by 
p l ■ z(V q i) > 0, we will have p l ■ z(p % ) > 0, contradicting to Walras' law. So we must have 
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Without loss of generality, we suppose p 1 G V p 2. Since p 2 ■ z(p l ) > by noting that 
p 1 G V q 2 and p 1 ■ z(p 01 ) > 0, then, by 2-recursive transfer lower semi-continuity, we have 
p 2 ■ z(V q i) > 0. Also, q 2 ■ z(V q 2) > 0. Thus p 2 ■ z(V q i U Vf) > 0, and consequently 

Again, without loss of generality, we suppose p 2 G V q z. Since p 3 ■ z{p 2 ) > by 
noting that p 2 G V p 3, p 2 ■ z(p l ) > 0, and p 1 ■ z(p 01 ) > 0, by 3-recursive transfer lower 
semi-continuity, we have p 3 ■ z(V q i) > 0. Also, since p 3 ■ z(p 2 ) > and p 2 ■ z(p 02 ) > 0, 
by 2-recursive transfer lower semi-continuity, we have p 3 ■ z(V q 2) > 0. Thus, we have 
p 3 ■ z(V q i U V q 2 U Vqs) > 0, and consequently p 3 V q i U V q 2 U V q 3. 

With this process going on, we can show that p k £ V q i U V q 2 U . . . , UV q k, i.e., p k is 
not in the union of V q i, V q 2, . . . , V q k for k — 1, 2, . . . , T. In particular, for k = T, we have 
p L £ V 9 i U V q 2 . . . U V qT and so p T ^ S' L " 1 CV,iUV,2...U V qT , a contradiction. 

Thus, there exists p* G S^ 1 such that p • z(p*) < for all p G S' L_1 . Letting 
p l = (1, 0, . . . , 0), p 2 = (0, 1, 0, . . . , 0), and p L = (0, 0, ... , 0, 1), we have z\p*) < for 
/ = 1, . . . , L and thus p* is a price equilibrium. 

Necessity (=>•). Suppose is a competitive price equilibrium and p ■ z(q) > for 
q,p G S' L_1 . Let p° = and ^(g) be a neighborhood of g. Since p ■ z(p*) < for all 
p G S 1 ^ 1 , it is impossible to find any sequence of finite price vectors {p 1 ,^ 2 , . . . ,p m } such 
that p 1 ■ z(p°) > 0, p 2 ■ zip 1 ) > 0, . . . , p m ■ z{p m ~ v ) > . Hence, the recursive transfer lower 
semi-continuity holds trivially. ■ 

This theorem is useful to check the nonexistence of competitive equilibrium. 

The method of proof employed to obtain Theorem 7.4.3 is new. While there are 
different ways of establishing the existence of competitive equilibrium, all the existing 
proofs essentially use the fixed-point-theorem related approaches. Moreover, a remarkable 
advantage of the above proof is that it is simple and elementary without using advanced 
math. 

The above three existence theorems assume that the excess demand function is well 
defined for all prices in the closed unit simplex S 1-1 , including zero prices. However, 
when preferences are strictly monotone, excess demand functions are not well defined on 
the boundary of S' L_1 so that the above existence theorems cannot be applied. Then, we 
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need an existence theorem for strictly positive prices, which is given below. 

Theorem 7.4.4 (The Existence Theorem I'")) For a private ownership economy e = 
({Xi, Wi, , {Yj}, {Oij}), suppose the aggregate excess demand function z{p) is defined 
for all strictly positive price vectors p G 3?+ + , and satisfies the following conditions 

(i) z(-) is continuous; 

(ii) £(•) is homogeneous of degree zero; 

(Hi) p ■ zip) = for all p G 3?+ + (interior Walras' Law); 

(iv) There is an s > such that z l ip) > — s for every commodity I and all 

v e K+; 

(v) If pk — > p, where p ^ and p l = for some I, then 

m&xi^ipk), z L (p k )} ->■ oo. 

Then there is p* G such that 

zip*) = (7.21) 
and thus p* is a competitive equilibrium. 

Proof. Because of homogeneity of degree zero we can restrict our search for an e- 
quilibrium in the unit simplex S. Denote its interior by intS. We want to construct 
a correspondence F from S to S such that any fixed point p* of F is a competitive 
equilibrium, i.e., p* G F(p*) implies zip*) = 0. 

Define a correspondence F : S — > 2 s by, 

{{q G S : zip) ■ q ^ zip) ■ q' for all q' G S} if p G intS" 
{q G 5 : p • q = 0} if p is on the boundary 

Note that, for p G intS*, F(-) means that, given the current "proposal" p G intS, 
the "counterproposal" assigned by the correspondence F(-) is any price vector q that 
maximizes the values of the excess demand vector among the permissible price vectors 
in S. Here F(-) can be thought as a rule that adjusts current prices in a direction that 
eliminates any excess demand, the correspondence F(-) assigns the highest prices to the 
commodities that are most in excess demand. In particular, we have 

HP) = {qeS:q l = 0ii z\p k ) < max^ 1 ^), . . . , z L ip k )}}. 
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Observe that if z(p) ^ for p G intS, then because of the interior Walras' law we 
have z l (p) < for some I and z l '(p) > for some V ^ I. Thus, for such a p, any q G 
has = for some I (to maximize the values of excess demand vectors). Therefore, if 
z(p) 7^ then F{p) C Boundary S = S \ IntS. In contrast, z{p) = then F{p) = S. 

Now we want to show that the correspondence F is an upper hemi-continuous corre- 
spondence with nonempty, convex, and compact values. First, note that by the construc- 
tion, the correspondence F is clearly compact and convex-valued. F{p) is also non-empty 
for all p G S. Indeed, when p G intS, any price vector q that maximizes the value of 
q' ■ z(p) is in F(p) and so it is not empty. When p is on the boundary of S, p l = for at 
least some good I and thus there exists a q G S such that p • q — 0, which implies F(p) is 
also non-empty. 

Now we show the correspondence F is upper hemi-continuous, or equivalently it has 
closed graph, i.e., for any sequences p t — )■ p and q t — >■ q with g t G F(pt) for all t, we have 
q G -F(p). There are two cases to consider. 

Case 1. p G mtS*. Then G intS* for sufficiently large. From q^ ■ z{pk) ^ q' • z{pk) 
for all q' G -F(pfe) and the continuity of z(-), we get q ■ z(p) ^ q' ■ z(p) for all q' G -F(p), 
i.e., q G F(p). 

Case 2. p is a boundary point of S. Take any / with p l > 0. We should argue that 
for k sufficiently large, we have q\ = 0, and therefore it must be that q l = 0; from this 
q G F{p) follows. Because p l > 0, there is an e > such that p\ > e for k sufficiently 
large. If, in addition, p k is on the boundary of S, then q l k = by the definition of F(p k ). 
If, instead, p k G intS, then by conditions (iv) and (v), for k sufficiently large, we must 
have 

z\p k ) < max{z\p k ), z L (p k )} 

and therefore that, again, q l k = 0. To prove the above inequality, note that by condition 
(v) the right-hand side of the above expression goes to infinity with k (because p is a 
boundary point of S, some prices go to zero as k — > oo). But the left-hand side is 
bounded above because if it is positive then 

z\p k ) ^ -p[z l (p k ) = - l - Y,pU l \pk) < 'I Y.p* < f> 

v^i i'^i 

where s is the bound in excess supply given by condition (iv). In summary, for p k close e- 
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nough to the boundary of S, the maximal demand corresponds to some of the commodities 
whose price is close to zero. Therefore, we conclude that, for large k, any q k G F(p k ) will 
put nonzero weight only on commodities whose prices approach zero. But this guarantees 
p ■ q = and so q G F{p). So F must be upper hemi-continuous. 

Thus the correspondence F is an upper hemi-continuous correspondence with nonemp- 
ty, convex, and compact values. Therefore, by Kakutani's fixed point theorem (see section 
1.2.8), we conclude that there is p* G S with p* G F(p*). 

Finally, we show that any fixed point p* of F is a competitive equilibrium. Suppose 
that p* G F(p*). Then p* cannot be a boundary point of S because p ■ p > and p ■ q = 
for all q G F(p) cannot occur simultaneously, and thus p* G intS. If z(p*) ^ 0, then 
z\p*) < for / and z k (p*) > for k by the interior Walras' Law. Thus, for such a p* , 
any q* G F(p*) must have q* 1 = because q* is the maximum of the function z(p*) ■ q, 
which means F(p*) is a subset of boundary points of S. Hence, if p* G F(p*), we must 
have z{p*) = 0. The proof is completed. 

As shown by Tian (2009), Theorem 7.4.3 can also be extended to the case of any set, 
especially the positive price open set, of price systems for which excess demand is defined. 
To do so, we introduce the following version of recursive transfer lower semi-continuity. 

Definition 7.4.4 Let D be a subset of intS' L_1 . An excess demand function £(•) : 
int S 1 ^ 1 — > M. L is said to be recursively transfer lower semi- continuous on int S 1 ^ 1 with 
respect to D if, whenever q G int S^ 1 is not a competitive equilibrium price system, there 
exists some price system p° G int S' L ~ 1 (possibly p° = q) and a neighborhood V q such that 
(1) whenever p° is upset by a price system in int S 1 ' 1 \D, it is upset by a price system 
in D, and (2) p ■ z(V q ) > for any p G D that recursively upsets p°. 

Now we have the following theorem that fully characterizes the existence of competitive 
equilibrium in economies with possibly indivisible commodity spaces and discontinuous 
excess demand functions. 

Theorem 7.4.5 (The Existence Theorem I"")) Suppose an excess demand function 
z(-) : intS L ~ l — > 1R L satisfies interior Walras' law: p ■ z(p) = for all p G intS 1 ^ 1 . 
Then there is a competitive price equilibrium p* G intS 1 " 1 if and only if there exists a 
compact subset D C intS 1 ^ 1 such that z(-) is recursively transfer lower semi- continuous 
on intS 1 ^ 1 with respect to D. 
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Proof. Sufficiency (<=)■ The proof of sufficiency is essentially the same as that of 
Theorem 7.4.3 and we just outline the proof here. To show the existence of a competitive 
equilibrium on S L_1 , it suffices to show that there exists a competitive equilibrium p* in 
D if it is recursively diagonal transfer lower semi-continuous on S 1 ^ 1 with respect to D. 
Suppose, by way of contradiction, that there is no competitive equilibrium in D. Then, 
since z is recursively diagonal transfer lower semi-continuous on S L ~ l with respect to D, 
for each q G D, there exists p° and a neighborhood V q such that (1) whenever p° is upset 
by a price system in S 1 ^ 1 \ D, it is upset by a price system in D and (2) p ■ z(V q ) > 
for any finite subset of price systems {p 1 , . . . ,p m } C D with p m = p and p ■ z(p m ~ v ) > 0, 
p-m-i . |^m-2j > 0, . . ., p 1 • z(p°) > for m > 1. Since there is no competitive equilibrium 
by the contrapositive hypothesis, p° is not a competitive equilibrium and thus, by recursive 
diagonal transfer lower semi-continuity on S 1 " 1 with respect to D, such a sequence of 
recursive securing price systems {p 1 , . . . , p m_1 ,p} exists for some m > 1. 

Since D is compact and D C \J qeS L-i V q , there is a finite set {q 1 , . . . , q T } C D such 
that D C Uj =1 V ql . For each of such q\ the corresponding initial deviation price system 
is denoted by p 0t so that p % ■ z(V q i) > whenever p 0t is recursively upset by p % through 
any finite subset of securing price systems {p l1 , . . . ,p mi } C D with p mi = p\ Then, by 
the same argument as in the proof of Theorem 7.4.3, we will obtain that z k is not in the 
union of V q i , V q 2 , . . . , V q k for k — 1, 2, . . . , T. For k = T, we have p T V q i U V q 2 . . . U V q r 
and so p T ^ D C |J i=1 V^, which contradicts that p T is a upsetting price in D. 

Thus, there exists a price system p* e S* 1-1 such that p • < for all p G int S' L ~ 1 . 
We want to show that p* in fact is a competitive price equilibrium. Note that int S' L_1 
is open and D is a compact subset of int S^ -1 . One can always find a sequence of price 
vector {q l n } C int S' 1 ' -1 \ such that — > p l as n — > oo, where p' = (0, . . . , 0, 1, 0, . . . , 0) 
is the unit vector that has only one argument - the Zth argument - with value 1 and others 
with value 0. Since p • z(q) is continuous in p, we have z\p*) < for I — 1, . . . , L and 
thus p* is a competitive price equilibrium. 

Necessity (=^). Suppose p* is a competitive equilibrium. Let D = {p*}. Then, the set 
D is clearly compact. Now, for any non competitive equilibrium q G S ,i_1 , let p° = p* 
and Vg be a neighborhood of g. Since p • i(p*) < for all p G S' L_1 and p° = p* is a 
unique element in D, there is no other upsetting price p 1 such that p 1 ■ z(p°) > 0. Hence, 
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the game is recursively diagonal transfer continuous on S 1 ^ 1 with respect to D. m 

Theorem 7.4.5 then strictly generalizes all the existing results on the existence of 
competitive equilibrium in economies with single-valued excess demand functions. 

• Conditions for Walras' Law to be true 

From the above theorems, Walras' Law is important to prove the existence of a com- 
petitive equilibrium. Under which conditions, is Walras' Law held? 
When each consumer's budget constraint holds with equality: 

j 



for all i, we have 



pxi(p) =p Wi + Y^ d ijPVj(p) 

3=1 



n n n J 

i=l i=l i=l j=l 

n J 

= ^pwi + ^pyjip) 

i=l j=l 



which implies that 



so that 



n n J 

p£>(p) - X> - i>;(p)] = (7.22) 

i=i i=i j=i 



p ■ z{p) = (Walras' Law) (7.23) 

Thus, as long as the budget line holds with equality and aggregate excess demand is 
well-defined on the domain of prices, strong Walras' Law must hold. 

The above existence theorems on competitive equilibrium are based on the assump- 
tions that the aggregate excess demand correspondence is single-valued and satisfies the 
strong Walras's Law. The questions are under what conditions on economic environments 
a budget constraint holds with equality for all optimal consumption bundles, and the ag- 
gregate excess demand correspondence is single-valued or convex-valued? The following 
various types of monotonicities and convexities of preferences with the first one strongest 
and the last one weakest may be used to answer these questions. 

• Types of monotonicity conditions 
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(1) Strict monotonicity: For any two consumption market bundles (x > x') 
with x 7^ x' implies x >~i x' . 

(2) Monotonicity: if x > x' implies that x >-j x' . 

(3) Local non-satiation: For any point x and any neighborhood, N(x), there 
is x' E N(x) such that x' >~i x. 

(4) Non-satiation: For any x, there exists x' such that x' >-j x. 

Remark 7.4.5 Monotonicity of preferences can be interpreted as individuals' desires for 
goods: the more, the better. Local non-satiation means individuals' desires are unlimited. 

• Types of convexities 

(i) Strict convexity: For any x and x' with x ^ x' and x ^ x', x\ = \x + (1 — 
\)x' >-i x 1 for A E (0,1). 




Figure 7.10: Strict convex indifference curves, 
(ii) Convexity: If x >~i x', then x\ = \x + (1 — X)x' >~i x' for A G (0, 1). 
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Figure 7.11: Linear indifference curves are convex, but not strict convex, 
(iii) Weak convexity: If x ^ x', then x\ >pi x' . 




Figure 7.12: "Thick" indifference curves are weakly convex, but not convex. 

The above conditions are ordered by increasing weakness, i.e., each of the above con- 
dition implies the next one, the converse may not be true by examining thick indifference 
curves and linear indifference curves, strictly convex indifference curves as showed in the 
above figures: 

Remark 7.4.6 The convexity of preferences implies that people want to diversify their 
consumptions, and thus, convexity can be viewed as the formal expression of basic measure 
of economic markets for diversification. Note that the strict convexity of >~i implies the 
conventional diminishing marginal rates of substitution (MRS), and weak convexity of )pi 
is equivalent to the quasi-concavity of utility function U{. Also notice that the continuity of 
>pi is a sufficient condition for the continuous utility representations, that is, it guarantees 
the existence of continuous utility function ««(•). 

Remark 7.4.7 Under the convexity of preferences )>=j, non-satiation implies local non- 
satiation. Why? The proof is left to readers. 
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Now we are ready to answer under which conditions strong Walras's Law holds, a 
demand correspondence can be function, and convex-valued. The following propositions 
answer the questions. 

Proposition 7.4.1 Under local non-satiation assumption, we have the budget constraint 
holds with equality, and thus the Walras's Law holds. 

Proposition 7.4.2 Under the strict convexity oftpi, Xi{p) becomes a (single-valued) func- 
tion. 

Proposition 7.4.3 Under the weak convexity of preferences, demand correspondence 
Xiip) is convex-valued. 

Strict convexity of production set: if yj G Yj and y| G lj, then the convex combination 
Ayj + (1 — A)y| G intYj for all < A < 1, where intYj denotes the interior points of Yj. 
The proof of the following proposition is based on the maximum theorem. 

Proposition 7.4.4 IfYj is compact (i.e., closed and bounded) and strictly convex, then 
the supply correspondence Vj{p) is a well defined single-valued and continuous function. 

Proof: By the maximum theorem, we know that Vj(p) is a non-empty valued upper 
hemi-continuous correspondence by the compactness of Yj (by noting that G Yj) for all 
p G 

Now we show it is single- valued. Suppose not. yj and y| are two profit maximizing 
production plans for p G 3?+, and thus pyj = py 2 -. Then, by the strict convexity of Yj, we 
have Ayj + (1 — A)y| G intY} for all < A < 1. Therefore, there exists some t > 1 such 
that 

t[\y) + (1 - A)yJ] G mtYj. (7.24) 

Then t[Apyj + (1 — X)pyf\ = tpy}- > pyj which contradicts the fact that is a profit 
maximizing production plan. 

So Vj(p) is a single- valued function. Thus, by the upper hemi-continuity of Vj{p), we 
know it is a single-valued and continuous function. 
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Figure 7.13: The upper contour set U w (xi) is given by all points above the indifference 
curve, and the lower contour set L w (xi) is given by all points below the indifference curve 
in the figure. 

Proposition 7.4.5 If)pi is continuous, strictly convex, locally non- satiated, and Wi > 0, 
then Xiip) is a continuous single-valued function and satisfies the budget constraint with 
equality for allp G 3?+ + - Consequently, (strong) Walras's Law is satisfied for allp e 

Proof. First note that, since Wi > 0, one can show that the budget constrained set Bi(p) 
is a continuous correspondence with non-empty and compact values and £=j is continuous. 
Then, by the maximum theorem, we know the demand correspondence Xi(p) is upper 
hemi-continuous. Furthermore, by the strict convexity of preferences, it is single-valued 
and continuous. Finally, by local non-satiation, we know the budget constraint holds with 
equality, and thus strong Walras's Law is satisfied. 

Then, from the above propositions, we can have the following existence theorem that 
provides sufficient conditions directly based on the fundamentals of the economy by ap- 
plying the Existence Theorem I'" above. 

Theorem 7.4.6 For a private ownership economy e = ({Xj, w i; , {Yj}, {9^; }), there 
exists a competitive equilibrium if the following conditions hold 



241 



(%) x t G 3^; 

(ii) )pi are continuous, strictly convex (which guarantee the demand function 
is single valued) and strictly monotonic (which guarantees strong Walras' 
Law holds); 

(Hi) Wi > 0; 

(iv) Yj are compact, strictly convex, G Y 3 , j = 1, 2, . . . , J. 

Note that ^ are continuous if the upper contour set U w {xi) = {x\ G and x\ t^i %i} 
and the lower contour set L w (xi) = {x[ G Xi and x\ ^ Xi} are closed. 

Proof: By the assumption imposed, we know that Xi{p) and Vj{p) are continuous and 
single-valued. Thus the aggregate excess demand function is a continuous single-valued 
function and satisfies interior Walras' Law by monotonicity of preferences. Thus, we only 
need to show conditions (iv) and (v) in Theorem Y" are also satisfied. The bound in 
(iv) follows from the nonnegativity of demand (i.e., the fact that Xi = IR+) and bounded 
production sets, which implies that a consumer's total net supply to the market of any 
good I can be no greater the sum of his initial endowment and upper bound of production 
sets. Finally, we show that condition v is satisfied. As some prices go to zero, a consumer 
whose wealth tends to a strictly positive limit (note that, because pw > 0, there must be 
at least one such consumer) and with strict monotonicity of preferences will demand an 
increasing large amount of some of the commodities whose prices go to zero. Hence, by 
Theorem I'", there is a competitive equilibrium. The proof is completed. 

Examples of Computing CE 

Example 7.4.1 Consider an exchange economy with two consumers and two goods with 

«>i = (l,0) u> 2 = (0,l) 

ui(xi) = (x\) a (x 2 1 ) 1 - a 0<a<l (7.25) 
u 2 (x 2 ) = (x\)\xlf' b < b < 1 

Let p = j£ 
Consumer l's problem is to solve 

maxui(xi) (7.26) 
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subject to 

x\+px\ = \ (7.27) 
Since utility functions are Cobb-Douglas types of functions, the solution is then given 

by 

x\{ P ) = a - = a (7.28) 

Consumer 2's problem is to solve 

maxti 2 (i2) (7.30) 

%2 



subject to 

The solution is given by 



- 2 u>) = — 

V 

Then, by the market clearing condition, 



x 2 +px 2 =p. (7.31) 



4(p) = bj ^ = b-p (7.32) 
4(P) = ^—^ = (!-&)• (7-33) 



x\(p) + xl(p) = 1 ^ a + bp = 1 (7.34) 
and thus the competitive equilibrium is given by 

P = pi = — " ( 7 - 35 ) 

This is true because, by Walras's Law, for L — 2, it is enough to show only one market 
clearing. 

Remark 7.4.8 Since the Cobb-Douglas utility function is widely used as an example of 
utility functions that have nice properties such as strict monotonicity on continuity, 
and strict quasi-concavity, it is useful to remember the functional form of the demand 
function derived from the Cobb-Douglas utility functions. It may be remarked that we 
can easily derive the demand function for the general function: Uiixi) = {x\) a {x1Y a > 
0, (3 > by the a suitable monotonic transformation. Indeed, by invariant to monotonic 
transformation of utility function, we can rewrite the utility function as 

\(x\) a (xlY}^ = ( x \)^{x\)^ (7.36) 
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so that we have 

x](p) = i 

V 



x\(p) = 4^ (7-37) 



and ^ 

x?(p) = ^ (7.38) 

when the budget line is given by 

p l x\+p 2 x\ = I. (7.39) 

Example 7.4.2 

n = 2 L = 2 

W! = (1,0) w 2 = (0,l) 

ui(xi) = (xj) (a:?) 1 - a < a < 1 
^2(^2) = min{x2, fer^} with 6 > 
For consumer 1, we have already obtained 

x\(p) = a, x\ = (7.41) 



(7.40) 



For Consumer 2, his problem is to solve: 

maxtf 2 (i2) (7.42) 

s.t. 

x\+p3%=p. (7.43) 

At the optimal consumption, we have 

x\ = bx\. (7.44) 

By substituting the solution into the budget equation, we have bx\ + px\ = p and thus 
x|(p) = ^and = 

Then, by x\{p) + ^(p) = 1, we have 

a + = 1 (7.45) 

b + p v 1 

or 

(1 -a)(6 + p) = bp (7.46) 
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so that 

(a + b-l)p = b(l-a) (7.47) 

Thus, 

* = Kl - a) 
P a + b-Y 

To make p* be a competitive equilibrium price, we need to assume a + b > 1. 

Note that there is no competitive equilibrium when a + b < 1 although the aggregate 
excess demand function is continuous on the domain. What causes the nonexistence of 
competitive equilibrium and which condition cannot be dropped? 

7.4.2 The Existence of CE for Aggregate Excess Demand Cor- 
respondences 

When preferences and/or production sets are not strictly convex, the demand correspon- 
dence and/or supply correspondence may not be single- valued, and consequently the 
aggregate excess demand correspondence may not be single- valued. As a result, one can- 
not use the above existence results to argue the existence of competitive equilibrium. 
Nevertheless, by using the KKM lemma, we can still prove the existence of competitive 
equilibrium when the aggregate excess demand correspondence satisfies certain conditions. 

Theorem 7.4.7 (The Existence Theorem II)) For a private ownership economy e = 
({Xj, Wi, , {Yj}, {Oij}), if z{p) is an non-empty convex and compact-valued upper hemi- 
continuous correspondence and satisfies weak Walras ' Law, then there exists a competitive 
equilibrium, that is, there is a price vector p* G S such that 

zip*) n {-^} ^ 0. (7.48) 

Proof. Define a correspondence F : S — > 2 s by, 

F(q) = {p G S : qz ^ for some z G zip)}. 

Since z(-) is upper hemi-continuous, F(q) is closed for each q G S. We now prove F is 
FS-convex. Suppose, by way of contradiction, that there are some qi, ■ ■ ■ ,q m £ S and 
some convex combination q x = Y1T=\ ^tQt suc h that q x U^ 1 F(g r t ). Then, q x F(q t ) 
for all t — 1, . . . , m. Thus, for all z G ziq\), we have q t z > for t — 1, . . . , m. Hence, 
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Y^t=i ^tQtZ = Q\z > which contradicts the fact that z satisfies Walras' Law. So F must 
be FS-convex. Therefore, by KKM lemma, we have 

n qe sF(q) ? 0. 

Then there exists a p* G S such that p* G ^ q esF{<l), i.e., p* G F(g) for all q e S. 
Thus, for each q <E S, there is i g G such that 

gz g ^ 0. 

We now prove z(p*) fl {—3?+} 7^ 0. Suppose not. Since z(p*) is convex and compact and 
—3?+ is convex and closed, by the Separating Hyperplane theorem, there exists a vector 
q G $t L and some c G dt L such that 

g • (-3^) < c < g*(p*) 

Since (—3?+) is a cone, we must have c > and q ■ (—3?+) ^ 0. Thus, g G 3?+ and 
qz(p*) > for all g, a contradiction. The proof is completed. 

Remark 7.4.9 The last part of the proof can be also shown by applying the following 
result: Let K be a compact convex set. Then K fl {— 3?+} 7^ if and only if for any 
p G 3ft^, there exists z G K such that ^ 0. The proof of this result can be found, for 
example, in the book of K. Border (1985, p. 13). 

Similarly, we have the following existence theorem that provides sufficient conditions 
directly based on economic environments by applying the Existence Theorem II above. 

Theorem 7.4.8 For a private ownership economy e = ({^Q, u>,, {Yj}, there 
exists a competitive equilibrium if the following conditions hold 

(1) X t G 3^/ 

(ii) >pi are continuous, weakly convex and strictly monotone; 
(Hi) w > 0; 

(iv) Yj are closed, convex, and G Y; j — 1, 2, . . . , J. 
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7.4.3 The Existence of CE for General Production Economies 

For a general private ownership production economy, 

e = ({X i ,)p i ,w i },{Y j },{8 ij }) (7.49) 

recall that a competitive equilibrium consists of a feasible allocation (x*,y*) and a price 
vector p* G such that 

(i) x* G Di(p*) = Xi{p*) (utility maximization, i — 1, 2, . . . n) 

(ii) y* G Si(p*) = Viip*) (profit maximization, j — 1,2, ... J) 

We now state the following existence theorem for general production economies without 
proof. Since the proof is very complicated, we refer readers to the proof that can be found 
in Debreu (1959) who used the existence theorem on equilibrium of the abstract economy 
on Section 7.7. 

Theorem 7.4.9 (Existence Theorem III, Debreu, 1959) A competitive equilibrium 
for the private- ownership economy e exists if the following conditions are satisfied: 

(1) Xi is closed, convex and bounded from below; 

(2) >pi are non-satiated; 

(3) >pi are continuous; 

(4) >pi are convex; 

(5) Wi G intXi; 

(6) G Yj (possibility of inaction); 

(7) Yj are closed and convex (continuity and no IRS) 

(8) Yj n {-Yj} = {0} (Irreversibility) 

(9) {-3^} C Yj (free disposal) 

7.5 The Uniqueness of Competitive Equilibria 

So now we know that a Walrasian equilibrum will exist under some regularity conditions 
such as continuity, monotonicity, and convexity of preferences and/or production possi- 
bility sets. We worry next about the other extreme possibility; for a given economy there 
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aer many equibira so that we do not know what is the outcome of the market proecess. 
We can easily give examples in which there are multiple competitive equilibrium price 
vectors. When is there only one normalized price vector that clears all markets? 

The free goods case is not of great interest here, so we will rule it out by means of 
the desirability assumption so that every equilibrium price of each good must be strictly 
positive. We want to also assume the continuous differentiability of the aggregate excess 
demand function. The reason is fairly clear; if indifference curves have kinks in them, we 
can find whole ranges of prices that are market equilibria. Not only are the equilibria not 
unique, they are not even locally unique. 

Thus, we answer this question for only considering the case of p* > and z{p) is 
differentiate. 

Theorem 7.5.1 Suppose all goods are desirable and gross substitute for all prices (i.e., 
&Z q^ > for I 7^ h). If p* is a competitive equilibrium price vector and Walras ' Law 
holds, then it is the unique competitive equilibrium price vector. 

Proof: By the desirability, p* > 0. Suppose p is another competitive equilibrium price 

I k 

vector that is not proportional to p*. Let m = max — for some k. By homogeneity 
and the definition of competitive equilibrium, we know that z{p*) = z(mp*) = 0. We 

k I h 

know that m = ^ = for all / = 1, . . . , L and m > ^ for some h. Then we have 
mp* 1 ^ p l for all / and mp* h > p h for some h. Thus, when the price of good k is fixed, 
the prices of the other goods are down. We must have the demand for good k down by 
the gross substitutes. Hence, we have z k (p) < 0, a contradiction. 

When the aggregate demand function satisfies the Weak Axiom of Revealed Preference 
(WARP) and Walras' Law holds, competitive equilibrium is also unique. 

The Weak Axiom of Revealed Preference (WARP) of the aggregate excess 
demand function: If pz(p) ^ pz(p'), then p' z(p) > p'z(p') for all p,p' G 9ft+. 
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Figure 7.14: The figure shows an aggregate demand function satisfies WARP. 

WARP implies that, if z(p') could have been bought at p where z(p) was bought (so 
that z(p) >- z(p') since z(p) is the optimal choice), then at price p', z(p) is outside the 
budget constraint (otherwise it contradicts to the fact that z{p) is the optimal choice). 




Figure 7.15: Both individual demand functions satisfy WARP. 



Even if the individual demand functions satisfy WARP, the aggregate excess demand 
function does not necessarily satisfy the WARP in aggregate. 
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Figure 7.16: The aggregate excess demand function does not satisfy WARP. 

The WARP is a weak restriction than the continuous concave utility function. Howev- 
er, the restriction on the aggregate excess demand function is not as weak as it may been 
seen. Even though two individuals satisfy the individual WARP, the aggregate excess 
demand function may not satisfy the aggregate WARP as shown in Figures 7.15 and 7.16. 

Lemma 7.5.1 Under the assumptions of strong Walras' Law and WARP, we have p* z{p) > 
for all p 7^ kp* where p* is a competitive equilibrium. 

Proof: Suppose p* is a competitive equilibrium: 



Also, by Walras' Law, pz(p) = 0. So we have pz{p) ^ pz{p*). Then, by WARP, p*z{p) > 
p*z{p*) = =4> p*z(p) > for all p ^ kp* . 

Theorem 7.5.2 Under the assumptions of Walras' Law and WARP in aggregate excess 
demand function, the competitive equilibrium is unique. 



Proof: By Lemma 7.5.1, for any p ^ kp* , p*z{p) > which means at least for some /, 
z l > 0. 



z(p*) ^ 0. 



(7.50) 
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7.6 Stability of Competitive Equilibrium 

The concept of competitive equilibrium is a stationary concept. But, it has given no 
guarantee that the economy will actually operate at the equilibrium point. Or an economy 
may deviate from an equilibrium due to economic shocks. What forces exist that might 
tend to move prices to a market-clearing price? This is a topic about the stability on the 
price adjustment mechanism in a competitive equilibrium. 

A paradoxical relationship between the idea of competition and price adjustment is 
that: If all agents take prices as given, how can prices move? Who is left to adjust prices? 

To solve this paradox, one introduces a "Walrasian auctioneer" whose sole function is 
to seek for the market clearing prices. The Walrasian auctioneer is supposed to call the 
prices and change the price mechanically responding to the aggregate excess demand till 
the market clears. Such a process is called Tatonnement adjustment process. 

Tatonnement Adjustment Process is defined, according to the laws of demand 
and supply, by 

G\z l {p)) l = l,...,L (7.51) 



dt 

where G l is a sign-preserving function of z(p), i.e., G\x) > if x > 0, G l (x) = if 
x = 0, and G l (x) < if x < 0. The above equation implies that when the aggregate 
excess demand is positive, we have a shortage and thus price should go up by the laws of 
demand and supply. 

As a special case of G l , G l can be an identical mapping such that 

p l = z\p) (7.52) 

p = z(p). (7.53) 



Under Walras' Law, 

d . . . d 



i=i 



i=i 



dp 1 
~dt 



= 2p'p = pz{p) = 



which means that the sum of squares of the prices remain constant as the price adjusts. 
This is another price normalization. The path of the prices are restricted on the surface 
of a A;-dimensional sphere. 
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Examples of price dynamics in Figure 7.17. The first and third figures show a stable 
equilibrium, the second and fourth figures show a unique unstable equilibrium. Formally, 
we have 

Definition 7.6.1 An equilibrium price p* is globally stable if 

(i) p* is the unique competitive equilibrium, 

(ii) for all p Q there exists a unique price path p = 4>(t,po) for ^ t < oo such 
that lim^oo 4>{t,p ) = p*. 

Definition 7.6.2 An equilibrium price p* is locally stable if there is 5 > and a unique 

price path p = (f>(t,po) such that lim^oo (fi(t,p ) = p* whenever \p* — p \ < 5. 
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Figure 7.17: The first and third figures show the CEs are stable, and the second and 
fourth show they are not stable. 

The local stability of a competitive equilibrium can be easily obtained from the stan- 
dard result on the local stability of a differentiate equation. 

Theorem 7.6.1 A competitive equilibrium price p* is locally stable if the Jacobean matrix 
defined by 

\dz l (p*)~ 

A= a 

OPk 

has all negative characteristic roots. 

The global stability result can be obtained by Liaponov Theorem. 
Liaponov's function: For a differentiate equation system x = f(x) with f(x*) = 0, 
a function V is said to be a Liaponov's function for the differentiate equation system if 

(1) there is a unique x* such that V(x*) = 0; 

(2) V{x) > for all x ^ x*; 

(3) ^ < 0. 

Theorem 7.6.2 (Liaponov's Theorem) If there exists a Liaponov's function for x = 
f(x), the unique stationary point x* is globally stable. 

Debreu (1974) has shown essentially that any continuous function which satisfies the 
Walras' Law is an aggregate demand function for some economy. Thus, utility maximiza- 
tion places no restriction on aggregate behavior. Thus, to get global stability, one has to 
make some additional assumptions such as gross substitutability and the Weak Axiom of 
Revealed Preference (WARP) for aggregate excess demand functions. 

Lemma 7.6.1 Under the assumptions of Walras' Law and gross substitutability, we have 
p*z{p) > for all p 7^ kp* where p* > is a competitive equilibrium. 

Proof. The proof of this lemma is complicated. We illustrate the proof only for the case 
of two commodities by aid of the figure. The general proof of this lemma can be seen in 
Arrow and Hahn, 1971, p. 224. 
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Figure 7.18: An illustration of the proof of Lemma 7.6.1. 

Let p* be an equilibrium price vector and x* = x{p*) be the aggregate equilibrium 
demand. Let p ^ ap* for any a > 0. Then we know p is not an equilibrium price vector 
by the uniqueness of the competitive equilibrium under the gross substitutability. Since 
px(p) = pw = px(p*) by Walras' Law and w = x(p*), then the aggregate demand x(p) is 
on the line AB which passes through the point x* and whose slope is given by p. Let CD 
be the line which passes through the point x* and whose shope is p*. We assume that 
p* 1 /p* 2 > p 1 /p 2 without loss of generality. Then p* 1 /p 1 > p* 2 /p 2 = //. Thus, p* 1 > fip 1 
and p* 2 = lip 2 . Therefore, we have x 2 (p*) > x 2 (fip) by the gross substitutability. But 
Walras' Law implies that fipx(fip) = fipx(p*) so that we must have x l (np) > x 1 ^*). 
Using the homogeneity assumption, we get x 1 ^) = x l (np) > x 1 ^*) = x* 1 and x 2 {p) = 
x 2 ([ip) < x 2 (p*) = x* 2 . Hence the point x(p) must lie to the right of the point x* in 
the figure. Now draw a line parallel to CD passing through the point x(p). We see that 
p*x(p) > p*x* and thus p*z(p) > 0. The proof is completed. 

Now we are ready to prove the following theorem on the global stability of a compet- 
itive equilibrium. 

Theorem 7.6.3 (Arrow-Block-Hurwicz) Under the assumptions of Walras' Law, if 
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z(p) satisfies either gross substitutability, or WARP, then the competitive equilibrium price 
is globally stable. 

Proof: By the gross substitutability or WARP of the aggregate excess demand function, 
the competitive p* is unique. We now show it is globally stable. Define a Liaponov's 
function by 

L 

V( P ) = 5><(i) -P* 1 ) 2 = ip-p*) ■ ip-p*) (7.54) 
1=1 

By the assumption, the competitive equilibrium price p* is unique. Also, since 

1=1 

= 2j2(p\ t) -p* l )z l (p) 

i=i 

= 2[pi(p) - p* z(p)\ 
= -2p*z(p) < 

by Walras' Law and Lemma 7.5.1 and 7.6.1 for p ^ kp*, we know p = z(p) is globally 
stable by Leaponov's theorem. 

The above theorem establishes the global stability of a competitive equilibrium under 
Walras' Law, homogeneity, and gross substitutability/WARP. It is natural to ask how far 
we can relax these assumptions, in particular gross substitutability. Since many goods in 
reality are complementary goods, can the global stability of a competitive equilibrium also 
hold for complementary goods? Scarf (1961) has constructed examples that show that 
a competitive equilibrium may not be globally stable in the presence of complementary 
goods. 

Example 7.6.1 (Scarf's Counterexample on Global Instability) Consider a pure 
exchange economy with three consumers and three commodities (n=3, L=3). Suppose 
consumers' endowments are given by Wi = (1, 0, 0), w 2 = (0, 1, 0) w 3 = (0, 0, 1) and their 
utility functions are given by 

ui(xi) = min{x},a;^} 
U2(x 2 ) = mm{xl,xl} 
u 3 (x 3 ) = mm{x\,xl} 
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so that they have L-shaped indifference curves. Then, the aggregate excess demand 
function is given by 

2 3 

*(P) = -pT^ + pT^ (7-55) 

* 2 (P) = -^—3 + ^T-2 ( 7 - 56 ) 
p2 _|_ p.i pi _|_ ^2 

1 2 

5 3 (p) = -^^ + ^^, (7.57) 

pi _|_ pi pZ _|_ 

from which the only possible competitive equilibrium price system is given by p* 1 = p* 2 = 
p* 3 = 1. 

Then, the dynamic adjustment equation is given by 

p = z(p). 

We know that \\p(t)\\ = constant for all t by Walras' Law. Now we want to show that 
nf=iP'(0 = constant for all t. Indeed, 

d 3 

i=i 

= z 1 p 2 p 3 + z 2 p 1 p 3 + z 3 p 1 p 2 = 0. 

Now, we show that the dynamic process is not globally stable. First choose the initial 
prices p l (0) such that E?=ib'( )] 2 = 3 and n?=iP'(°) ^ L Then > E?=ib^)] 2 = 3 and 
Q 3 =1 p'(t) 7^ 1 for all t. Since Ym=iW '(f)} 2 = 3 and the only possible equilibrium prices 
are p* 1 = p* 2 = p* 3 = 1, the solution of the above system of differential equations cannot 
converge to the equilibrium price p* = (1,1,1). 

In this example, we may note the following facts, (i) there is no substitution effect, (ii) 
the indifference curve is not strictly convex, and (hi) the difference curve has a kink and 
hence is not differentiable. Scarf (1961) also provided the examples of instabiity in which 
the substitution effect is present for the case of Giffen's goods. Thus, Scarf's examples 
indicate that instability may occur in a wide variety of cases. 



7.7 Abstract Economy 

The abstract economy defined by Debreu (Econometrica, 1952) generalizes the notion 
of N-person Nash noncooperative game in that a player's strategy set depends on the 
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strategy choices of all the other players and can be used to prove the existence of compet- 
itive equilibrium since the market mechanism can be regarded as an abstract economy as 
shown by Arrow and Debreu (Econometrica 1954). Debreu (1952) proved the existence 
of equilibrium in abstract economies with finitely many agents and finite dimensional s- 
trategy spaces by assuming the existence of continuous utility functions. Since Debreu's 
seminal work on abstract economies, many existence results have been given in the litera- 
ture. Shafer and Sonnenschein (J. of Mathematical Economics, 1975) extended Debreu's 
results to abstract economies without ordered preferences. 

7.7.1 Equilibrium in Abstract Economy 

Let N be the set of agents which is any countable or uncountable set. Each agent % 
chooses a strategy Xi in a set Xi of IR L . Denote by X the (Cartesian) product rijgAr^j 
and X_j the product rijejv\{i} Xj. Denote by x and x^i an element of X and X_j. Each 
agent % has a payoff (utility) function Ui : X — >■ R. Note that agent i's utility is not only 
dependent on his own choice, but also dependent on the choice of the others. Given x_j 
(the strategies of others), the choice of the i-th agent is restricted to a non-empty feasible 
choice set C Xi, the i-th agent chooses Xi G so as to maximize Ui(x-i,Xi) 

over Fi(x-i). 

An abstract economy (or called generalized game) T = (Xi, Fj,-Uj) ieA r is defined as a 
family of ordered triples (Xi, Fj, Pi). 

Definition 7.7.1 A vector x* G X is said to be an equilibrium of an abstract economy if 
Vi G N 

(i) x* G Fi(x*_i) and 

(ii) x* maximizes Ui(x*L i ,Xi) over F^x*^). 

If Fi(x-i) = Xi, Vi G N, the abstract economy reduces to the conventional game 
T = (Xi,Ui) and the equilibrium is called a Nash equilibrium. 

Theorem 7.7.1 (Arrow-Debreu) Let X be a non-empty compact convex subset ofW 11 . 
Suppose that 
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i) the correspondence F: X — > 2 X is a continuous correspondence with non- 

empty compact and convex values, 

ii) Ui : X x X — > K. is continuous, 

Hi) Ui : X x X —> M. is either quasi-concave in Xi or it has a unique maximum 
on Fi(x-i) for all x_i G X_j. 

Then Y has an equilibrium. 

Proof. For each % G N, define the maximizing correspondence 

Mi(x-i) = {xi G Fi(x-i) : Ui(x_ h Xi) ^ Ui(x_ h Zi), Wz { G F^x^)}. 

Then, the correspondence Mj : X_j — > 2 Xt is non-empty compact and convex valued 
because Ui is continuous in x and quasi-concave in Xi and is non-empty con- 

vex compact-valued. Also, by the Maximum Theorem, Mj is an upper hemi-continuous 
correspondence. Therefore the correspondence 

M(x) = l[M t (x. l ) 

is an upper hemi-continuous correspondence with non-empty convex compact-values. 
Thus, by Kakutani's Fixed Point Theorem, there exists x* G X such that x* G M(x*) 
and x* is an equilibrium in the generalized game. Q.E.D 

A competitive market mechanism can be regarded as an abstract economy. For sim- 
plicity, consider an exchange economy e = (Xi,Ui,Wi)i e N- Define an abstract economy 
T = (Z, Fi, Ui) ie N+i as follows. Let 

Zi = Xi i = 1, ... ,n 

Zn+l = S L 1 

Fi(xi,p) = {xi e Xi : pxi ^ pwi} i = l,...,n 
F n +i = S L 1 

n 

u n+1 (p,x) = ^2p(xi-Wi) (7.58) 
i=i 

Here iV + l's sole purpose is to change prices, and thus can be regarded as a Walrasian 
auctioneer. Then, we verify that the economy e has a competitive equilibrium if the 
abstract economy defined above has an equilibrium by noting that Y^i=i x i = Y27=i Wi a ^ 
the equilibrium of the abstract equilibrium. 
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7.7.2 The Existence of Equilibrium for General Preferences 

The above theorem on the existence of an equilibrium in abstract economy has assumed 
the preference relation is an ordering and can be represented by a utility function. In 
this subsection, we consider the existence of equilibrium in an abstract economy where 
individuals' preferences >p may be non total or not-transitive. Define agent i's preference 
correspondence Pi : X — > 2 Xi by 

We call T = (X i: Fi, Pi)i<z N an abstract economy. 

A generalized game (or an abstract economy) Y = (Xi, Fi, Pi)i £ N is defined as a family 
of ordered triples (Xi, Fi, Pi). An equilibrium for r is an x* G X such that x* G F(x*) 
and Pi(x*) n = for each i G N. 

Shafer and Sonnenschein (1975) proved the following theorem that generalizes the 
above theorem to an abstract economy with non-complete/non-transitive preferences. 

Theorem 7.7.2 (Shafer-Sonnenschein ) Let T = (X i: F i: Pi) ieN be an abstract econ- 
omy satisfying the following conditions for each i <E N: 

(i) Xi is a non-empty, compact, and convex subset in M. li , 

(ii) Fi : X — » 2 Xi is a continuous correspondence with non-empty, compact, 
and convex values, 

(iii) Pi has open graph, 

(iv) Xi ^ con Pi(x) for all x G Z . 

Then Y has an equilibrium. 

This theorem requires the preferences have open graph. Tian (International Journal of 
Game Theory, 1992) proved the following theorem that is more general and generalizes 
the results of Debreu (1952), Shafer and Sonnenschein (1975) by relaxing the openness of 
graphs or lower sections of preference correspondences. Before proceeding to the theorem, 
we state some technical lemmas which were due to Micheal (1956, Propositions 2.5, 2.6 
and Theorem 3.1"'). 
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Lemma 7.7.1 Let X C IR M and Y C R K be two convex subsets and <p : X ->■ 2 y ; 

■0 : X — )■ 2 y &e correspondences such that 

(i) zs /ower hemi- continuous, convex valued, and has open upper sections, 

(ii) ip is lower hemi- continuous, 

(iii) for all x G X, (f>(x) H ip(x) ^ 0. 

T/ien £/ie correspondence 9 : X — > 2 y defined by 6{x) = <f>(x) fl ^0*0 ^ s /ower hemi- 
continuous. 

Lemma 7.7.2 Let X C K M and F C IR X fre two convex subsets, and let : X ->■ 2 y &e 
/ower hemi- continuous. Then the correspondence ip : X — > 2 Y defined by i]){x) = con<j)(x) 
is lower hemi- continuous. 

Lemma 7.7.3 Let X C R M and Y C M x fre too convex subsets. Suppose F : X ->■ 2 y is 
a lower hemi- continuous correspondence with non-empty and convex values. Then there 
exists a continuous function f : X — > Y such that f(x) G F(x) for all x G X . 

Theorem 7.7.3 (Tian) Let Y = (Xj, Fj, Pi)ieN be a generalized game satisfying for each 
i G X: 

(i) Xj zs a non-empty, compact, and convex subset in M. li , 

(ii) Fi is a continuous correspondence, and Fi(x) is non-empty, compact, and 
convex for all x G X , 

(iii) Pi is lower hemi- continuous and has open upper sections, 

(iv) Xi con Pi(x) for all x G F. 

Then Y has an equilibrium. 

Proof. For each % G X, define a correspondence A, L : X — > 2 Xi by Aj(x) = Fj (x) fl con Pj(x). 
Let Ui = {x G X : A (a;) 7^ 0}. Since Fj and Pj are lower hemi-continuous in X, so are 
they in C/j. Then, by Lemma 7.7.2, con Pi is lower hemi-continuous in Ui. Also since 
Pi has open upper sections in X, so does conp in X and thus con Pi in C/j. Further, 
Fj(a;) fl conPi(x) 7^ for all x G £/j. Hence, by Lemma 7.7.1, the correspondence A^Ui : 
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Ui — > 2 Xi is lower hemi-continuous in U and for all x G C/j, F(x) is non-empty and convex. 
Also Xi is finite dimensional. Hence, by Lemma 7.7.3, there exists a continuous function 
fi : Ui — > Xi such that fi(x) G -Aj(x) for all x <E U. Note that U is open since is lower 
hemi-continuous. Define a correspondence Gi : X — > 2 Xt by 



Then Gj is upper hemi-continuous. Thus the correspondence G : X — > 2 defined by 
= YlieN is upper hemi-continuous and for all x G X, G(x) is non-empty, 

closed, and convex. Hence, by Kakutani's Fixed Point Theorem, there exists a point 
x* G X such that x* G G(x*). Note that for each % G iV, if G U u then x* = fi{x*) G 
C conPj(a;*), a contradiction to (iv). Hence, x* G" C/j and thus for all « G iV, 

G and Fj(a;*) H conPi(x*) = which implies n = 0. Thus T has 

an equilibrium. ■ 

Note that a correspondence P has open graph implies that it has upper and lower open 
sections; a correspondence P has lower open sections implies P is lower hemi-continuous. 
Thus, the above theorem is indeed weaker. 
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Chapter 8 

Normative Theory of Equilibrium: 
Its Welfare Properties 

8.1 Introduction 

In the preceding chapter, we studied the conditions which would guarantee the existence, 
uniqueness, and stability of competitive equilibrium. In this chapter, we will study the 
welfare properties of competitive equilibrium. 

Economists are interested not only in describing the world of the market economy, but 
also in evaluating it. Does a competitive market do a "good" job in allocating resources? 
Adam Smith's "invisible hand" says that market economy is efficient. Then in what sense 
and under what conditions is a market efficient? The concept of efficiency is related to 
a concern with the well-being of those in the economy. The normative analysis can not 
only help us understand what advantage the market mechanism has, it can also help us 
to evaluate an economic system used in the real world, as well as help us understand why 
China and East European countries want to change their economic institutions. 

The term of economic efficiency consists of three requirements: 

(1) Exchange efficiency: goods are traded efficiently so that no further mutual 
beneficial trade can be obtained. 

(2) Production efficiency: there is no waste of resources in producing goods. 

(3) Product mix efficiency, i.e., the mix of goods produced by the economy 



264 



reflects the preferences of those in the economy. 

8.2 Pareto Efficiency of Allocation 

When economists talk about the efficiency of allocation resources, it means Pareto effi- 
ciency It provides a minimum criterion for efficiency of using resources. 

Under the market institution, we want to know what is the relationship between 
a market equilibrium and Pareto efficiency. There are two basic questions: (1) If a 
market (not necessarily competitive) equilibrium exists, is it Pareto efficient? (2) Can any 
Pareto efficient allocation be obtained through the market mechanism by redistributing 
endowments? 

The concept of Pareto efficiency is not just to study the efficiency of a market economy, 
but it also can be used to study the efficiency of any economic system. First let us define 
the notion of Pareto improvement. 

Definition 8.2.1 (Pareto Improvement:) An allocation can have a Pareto improve- 
ment if it is feasible and there is another feasible allocation such that one person would 
be better off and all other persons are not worse off. 

Definition 8.2.2 (Pareto Efficiency, also called Pareto Optimality:) An allocation 
is Pareto efficient or Pareto optimal (in short, P.O) if there is no Pareto improvement, 
that is, if it is feasible and there is no other feasible allocation such that one person would 
be better off and all other persons are not worse off. 

More precisely, for exchange economies, a feasible allocation x is P.O. if there is no 
other allocation x' such that 

(i) zr=i < ^ zr=i w i 

(ii) x\ >pi Xi for all i and x' k >~k x^ for some k — 1, . . . , n. 
For production economy, (x, y) is Pareto optimal if and only if: 

(1) x ^ y + w 
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(2) there is no feasible allocation (x',y') s.t. 



x\ >p Xi for all i 
x'k ~>~k Xk for some k 



A weaker concept about economic efficiency is the so-called weak Pareto efficiency. 

Definition 8.2.3 (Weak Pareto Efficiency) An allocation is weakly Pareto efficient if 

it is feasible and there is no other feasible allocation such that all persons are better off. 

Remark 8.2.1 Some textbooks such as Varian (1992) has used weak Pareto optimality 
as the definition of Pareto optimality. Under which conditions are they equivalent? It is 
clear that Pareto efficiency implies weak Pareto efficiency. But the converse may not be 
true. However, under the continuity and strict monotonicity of preferences, the converse 
is true. 

Proposition 8.2.1 Under the continuity and strict monotonicity of preferences, weak 
Pareto efficiency implies Pareto efficiency. 

Proof. Suppose x is weakly Pareto efficient but not Pareto efficient. Then, there exists a 
feasible allocation x' such that x\ )pi Xi for all % and x' k >~k %k for some k. 
Define x as follows 



which means x is feasible. Furthermore, by the continuity of preferences, x^ = (l — 9)x' k y 
Xk when 6 is sufficiently close to zero, and Xi >-j X{ for all % ^ k by the strict monotonicity 
of preferences. This contradicts the fact that x is weakly Pareto optimal. Thus, we must 
have every weak Pareto efficient allocation is Pareto efficient under the monotonicity and 



x k = 




x' k fori^k 



Then we have 




(8.1) 



continuity of preferences. 
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Remark 8.2.2 The trick of taking a little from one person and then equally distribute 
it to the others will make every one better off is a useful one. We will use the same trick 
to prove the second theorem of welfare economics. 

Remark 8.2.3 The above proposition also depends on an implicit assumption that the 
goods under consideration are all private goods. Tian (Economics Letters, 1988) showed, 
by example that, if goods are public goods, we may not have such equivalence between 
Pareto efficiency and weak Pareto efficiency under the monotonicity and continuity of 
preferences. 

The set of Pareto efficient allocations can be shown with the Edgeworth Box. Every 
point in the Edgeworth Box is attainable. A is the starting point. Is "A" a weak Pareto 
optimal? No. The point C, for example in the shaded area is better off to both persons. 
Is the point "B" a Pareto optimal? YES. Actually, all tangent points are Pareto efficient 
points where at least some agent does not have incentives to trade. The locus of all Pareto 
efficient is called the contract curve. 




Figure 8.1: The set of Pareto efficient allocations is given by the contract curve. 
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Remark 8.2.4 Equity and Pareto efficiency are two different concepts. The points like 
Oa or Ob are Pareto optimal points, but these are extremely unequal. To make an 
allocation be relatively equitable and also efficient, government needs to implement some 
institutional arrangements such as tax, subsidy to balance between equity and efficiency, 
but this is a value judgement and policy issue. 

We will show that, when Pareto efficient points are given by tangent points of two 
persons' indifference curves, it should satisfy the following conditions: 

M RS^i x 2 = M RS^i x 2 

Xa + x B = w. 

When indifference curves of two agents are never tangent, we have the following cases. 

Case 1. For linear indifference curves with non-zero (negative) slope, indifference 
curves of agents may not be tangent. How can we find the set of Pareto efficient allocations 
in this case? We can do it by comparing the steepness of indifference curves. 



A 




Figure 8.2: The set of Pareto efficient allocations is given by the upper and left edges of 
the box when indifference curves are linear and Person B's indifference curves are steeper. 

MRSX* < MRS% X 2 (8.2) 
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In this case, when indifference curves for B is given, say, by Line AA, then K is a Pareto 
efficient point. When indifferent curves for A is given, say, by Line BB, then P is a Pareto 
efficient point. Contract curve is then given by the upper and left edge of the box. As a 
usual method of finding all possible Pareto efficient points, wether or not difference curves 
have tangent points, taken one person's any indifference as given, find another person's 
utility maximizing point. The maximizing point must be Pareto efficient point. 
Case 2. Suppose that indifference curves are given by 

ua{x a ) = x\ 

and 

u B {x B ) = x B 



















< 


I 
















/"-\ 




— * 



Figure 8.3: The only Pareto efficient allocation point is given by the upper and left corner 
of the box when individuals only care about one commodity. 

Then, only Pareto efficient is the left upper corner point. But the set of weakly Pareto 
efficient is given by the upper and left edge of the box. Notice that utility functions in this 
example are continuous and monotonic, but a weak Pareto efficient allocation may not 
be Pareto efficient. This example shows that the strict monotonicity cannot be replaced 
by the monotonicity for the equivalence of Pareto efficiency and weak efficiency. 
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Case 3. Now suppose that indifference curves are perfect complementary. Then, utility 
functions are given by 

u>a(xa) = min{ax\, bx 2 A } 

and 

ub(xb) = min{cx B , dx 2 B } 

A special case is the one where a — b — c — d — 1. 

Then, the set of Pareto optimal allocations is the area given by points between two 
45° lines. 

Case 4. One person's indifference curves are "thick." In this case, an weak Pareto 
efficient allocation may not be Pareto efficient. 




Figure 8.4: The first figure shows that the contract curve may be the "thick" when 
indifference curves are perfect complementary. The second figure shows that a weak 
Pareo efficient allocation may not Pareto efficient when indifference curves are "thick." 
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8.3 The First Fundamental Theorem of Welfare Eco- 
nomics 



There is a well-known theorem, in fact, one of the most important theorems in economics, 
which characterizes a desirable nature of the competitive market institution. It claims 
that every competitive equilibrium allocation is Pareto efficient. A remarkable part of 
this theorem is that the theorem requires few assumptions, much fewer than those for 
the existence of competitive equilibrium. Some implicit assumptions in this section are 
that preferences are orderings, complete information goods are divisible, and there are no 
public goods, or externalities. 

Theorem 8.3.1 (The First Fundamental Theorem of Welfare Economics) If(x,y 
is a competitive equilibrium, then (x, y) is weakly Pareto efficient, and further under local 
non- satiation, it is Pareto efficient. 

Proof: Suppose (x, y) is not weakly Pareto optimal, then there exists another feasible 
allocation (x f , y') such that x\ >~i Xi for all i. Thus, we must have px\ > pwi + Ylj=i ^ijPVj 
for all %. Therefore, by summation, we have 

n n J 

^px'i >^pwi + ^pyj- 

i=l i=l j=l 

Since pyj ^ py'j for all y[- G Yj by profit maximization, we have 

n n J 

^2px[>Y^ P Wi + py 'r ( 8 - 3 ) 

1=1 1=1 j=l 

or 

n n J 

pE^-I>i-E$ >0 ' (8,4) 

i=l i=l j=l 

which contradicts the fact that (x', y') is feasible. 

To show Pareto efficiency, suppose (x,y) is not Pareto optimal. Then there exists 
another feasible allocation (x',y f ) such that x\ ^ X{ for all i and x' k >~k Xk for some k . 
Thus we have, by local non-satiation, 

j 

px[ ^pWi + ^2 d ijPVj Vi 
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and by x' k >~k %k why? readers can prove this, 

j 

px' k >pW k + Y^ °kjPVj 

and thus 

n n J n J 

Y p x 'i > Y p Wi + Y = Y pWi + Y py 'j ■ ( 8 - 5 ) 

i=l i=l j=l i=l j=l 

Again, it contradicts the fact that (x' , y') is feasible. ■ 

Remark 8.3.1 If the local non-satiation condition is not satisfied, a competitive equi- 
librium allocation x may not be Pareto optimal, say, for the case of thick indifference 
curves. 



1 












\ * 









Figure 8.5: A CE allocation may not be Pareto efficient when the local non-satiation 
condition is not satisfied. 

Remark 8.3.2 Note that neither convexity of preferences nor convexity of production 
set is assumed in the theorem. The conditions required for Pareto efficiency of com- 
petitive equilibrium is much weaker than the conditions for the existence of competitive 
equilibrium. 
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Figure 8.6: A CE allocation may not be Pareto efficient when goods are indivisible. 



Remark 8.3.3 If goods are indivisible, then a competitive equilibrium allocation x may 
not be Pareto optimal. 

The point a; is a competitive equilibrium allocation, but not Pareto optimal since x' is 
preferred by person 1. Hence, the divisibility condition cannot be dropped for the First 
Fundamental Theorem of Welfare Theorem to be true. 

8.4 Calculations of Pareto Optimum by First-Order 
Conditions 

The first-order conditions for Pareto efficiency are a bit harder to formulate. However, 
the following trick is very useful. 
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8.4.1 Exchange Economies 

Proposition 8.4.1 A feasible allocation x* is Pareto efficient if and only if x* solves the 
following problem for all i — 1,2, ... ,n 



Proof. Suppose x* solves all maximizing problems but x* is not Pareto efficient. This 
means that there is some allocation x' where one consumer is better off, and the others are 
not worse off. But, then x* does not solve one of the maximizing problems, a contradiction. 
Conversely, suppose x* is Pareto efficient, but it does not solve one of the problems. 
Instead, let x' solve that particular problem. Then x' makes one of the agents better 
off without hurting any of the other agents, which contradicts the assumption that x* is 
Pareto efficient. The proof is completed. ■ 
If utility functions Ui(xi) are differentiable, then we can define the Lagrangian function 
to get the optimal solution to the above problem: 



maxMj(xj) 



X 



s.t. 




u k (x k ) ^ u k (x* k ) for k^i 




The first order conditions are then given by 




= t k *\ k) - q l ^ with equality if x{ > I = 1, . . . , L; k ^ I (8.7) 



-q l ^0 with equality if x\ > I = 1, . . . , L, % = 1, . . . , n, (8.6) 



By (8.6), when x* is an interior solution, we have 



94 



= MRS r i t h 



(8.8) 




,h 



By (8.7) 



9u k (x k ) 



(8.9) 



du k (x k ) 



Thus, we have 



MRS x i^ x h = • • • = M RS x i n X h I = 1, 2, . . . , L; h — 1, 2, . . . , L. 



(8.10) 
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which are the necessary conditions for the interior solutions to be Pareto efficient, which 
means that the MRS of any two goods are all equal for all agents. They become sufficient 
conditions when utility functions tij(xj) are differentiate and quasi-concave. 



8.4.2 Production Economies 

For simplicity, we assume there is only one firm. Let T(y) f± be the transformation 
frontier. Similarly, we can prove the following proposition. 

Proposition 8.4.2 A feasible allocation (x*,y*) is Pareto efficient if and only if (x*,y*) 
solves the following problem for all i — 1,2, ... ,n 

max«j(i,) 

X 

s.t. ^2x k = ^2w k + y 

u k (x k ) > u k (x* k ) for k^i 
T(y) = 0. 

If utility functions Ui(xi) are differentiate, then we can define the Lagrangian function 
to get the first order conditions: 

L = Ui(xi) - XT(x - w) + ^2t k [u k (x k ) - u k (x* k )] 

FOC: 

dL = du^_ xl dT(yl^ o withequalityif;r / >0 z = l,...,L,i = l,...,n,(8.11) 

ax\ ax\ ax\ 

dL = tk du^_ xl dny]_^ Q ^ . t i{x i Q i = i j2 ,. ..,L;k^l (8.12) 

dx l k dx k dx\ 

When x* is an interior solution, we have by (8.11) 



&■? 

and by (8.12) 



dujjxj) dT(y) 
dudxj) 8T{y) ' 



(8.13) 



du k (x k ) &T{y) 

94 = 
du k (x k ) &T(y) ' 

dx\ dy h 



U4) 
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Thus, we have 

MRS x i i:X h = ■■■ = MRS x i n , x h = MKTS ' yKyh I — 1,2, ... ,L;h— 1,2, ... ,L (8.15) 

which are the necessary condition for the interior solutions to be Pareto efficient, which 
means that the MRS of any two goods for all agents equals the MRTS. They become 
sufficient conditions when utility functions Ui(xi) are differentiate and quasi-concave and 
the production functions are concave. 

8.5 The Second Fundamental Theorem of Welfare E- 
conomics 

Now we can assert a converse of the First Fundamental Theorem of Welfare Economics. 
The Second Fundamental Theorem of Welfare Economics gives conditions under which 
a Pareto optimum allocation can be "supported" by a competitive equilibrium if we 
allow some redistribution of endowments. It tells us, under some regularity assumptions, 
including essential condition of convexity of preferences and production sets, that any 
desired Pareto optimal allocation can be achieved as a market-based equilibrium with 
transfers. 

We first define the competitive equilibrium with transfer payments (also called an 
equilibrium relative to a price system), that is, a competitive equilibrium is established 
after transferring some of initial endowments between agents. 

Definition 8.5.1 (Competitive Equilibrium with Transfer Payments) For an e- 

conomy, e = {e\, . . . ,e n , {Yj}), (x,y,p) E X x Y x R+ is a competitive equilibrium with 
transfer payment if 

(i) Xi )pi x\ for all x' i E {x[ E Xi : px\ ^ pXi} for i — 1, . . . , n. 

(ii) pyj ^ py'j for y\ E Yj. 

(iii) x ^ w + y (feasibility condition). 

Remark 8.5.1 An equilibrium with transfer payments is different from a competitive 
equilibrium with respect to a budget constrained utility maximization that uses a value 
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calculated from the initial endowment. An equilibrium with transfer payments is defined 
without reference to the distribution of initial endowment, given the total amount. 

Remark 8.5.2 If Condition (i) above is replaced by 

(i') x\ >~j Xi implies that px\ > px{\ for i — 1, . . . , n, 

(x,y,p) G X x Y x is called a price quasi-equilibrium with transfer payment. It may 
be remarked that, if w > 0, preferences are continuous and locally non-statiated, then 
any price quasi-equilibrium with transfers in which X{ > is a price equilibrium with 
transfers. Furthermore, when preferences are strictly monotone, we must have p > in 
any price quasi-equilibrium with transfers. For the proofs, see Mas-Colell, Whinston, and 
Green (1995). 

The following theorem which is called the Second Fundamental Theorem of Welfare 
Economics shows that every Pareto efficient allocation can be supported by a competitive 
equilibrium through a redistribution of endowments so that one does not need to seek 
any alternative economic institution to reach Pareto efficient allocations. This is one of 
the most important theorems in modern economics, and the theorem is also one of the 
theorems in microeconomic theory whose proof is complicated. 

Theorem 8.5.1 (The Second Fundamental Theorem of Welfare Economics) Suppose 
(x*,y*) with x* > is Pareto optimal, suppose >pi are continuous, convex and strictly 
monotonic, and suppose that Yj are closed and convex. Then, there is a price vector 
p>0 such that (x*,y*,p) is a competitive equilibrium with transfer payments, i.e., 

(1) if x\ >~i x*, then px\ > px* for i = 1, . . . ,n. 



(2) py* ^ py'j for all y'j G Yj and j — 1, . . 



,J. 



Proof: Let 



P(x*) = {x t G 



Xi 




(8.16) 



be the strict upper contour set and let 



n 




(8.17) 



i=i 



By the convexity of >pi, we know that P(x*) and thus P(x*) are convex. 
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Figure 8.7: P(x*) is the set of all points strictly above the indifference curve through x*. 

Let W = {w} + J2j=i Yj wn i cn is closed and convex. Then W fl P(x*) = by Pareto 
optimality of (x*,y*), and thus, by the Separating Hyperplane Theorem in Chapter 1, 
there is a p ^ such that 

pz ^ pa for all z G and a e W (8.18) 

Now we show that p is a competitive equilibrium price vector by the following four steps. 
I. p ^ 

To see this, let e l — (0, . . . , 1, 0, . . . , 0) with the l-ih component one and other places 
zero. Let 

z = a + e l for some a G W. 

Then z = a + e l E P(x*) by strict monotonicity and redistribution of e l . Thus, we 
have by (8.18). 

p(a + e l )^pa (8.19) 

and thus 

pe l ^ (8.20) 

which means 

p l ^0 for I = 1,2,..., L. (8.21) 



278 



2. py* ^ p Vj for all G Y}, j — 1,...,J. 

Since x* = y* + w by noting (x*,y*) is a Pareto efficient allocation and preference 
orderings are strictly monotonic, we have px* = p{w + y*). Thus, by pz ^ p(w + y) 
in (8.18) and pw = px* — py*, we have 

p(z - x*) ^ p(y - y*) My G Y. 

Letting z — > x*, we have 

p(y-y*)S0 VyeY. 
Letting y^ = y* k for k ^ j, we have from the above equation, 

PVj ^ PVj V% G Y}. 



3. If Xj x*, then 
To see this, let 



pxi^px*. (8.22) 



x '. = (1 - 8)xi 0<9 <l 

x' k = x* k + ^— — -x* /or k^i 



Then, by the continuity of )pi and the strict monotonicity of )pk, we have x\ >~j x* 
for alH G N, and thus 

x' G P(x*) (8.23) 
if 6 1 is sufficiently small. By (8.18), we have 

fc^i fc^i fe=l 

and thus we have 

px; ^ px* (8.25) 

4. If Xj >-j x*, we must have px^ > px*. To show this, suppose by way of contradiction, 
that 

pxi = px* (8.26) 

Since x« >~i x*, then Axj >-j x* for A sufficiently close to one by the continuity of 
preferences for < A < 1. By step 3, we know ApXj ^ px* = pxi so that A ^ 1 by 
pxi = px* > 0, which contradicts the fact that A < 1. 
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Remark 8.5.3 When w > 0, the strict monotonicity condition of preferences can be 
weakened to locally non-statiated. Also, Mas-Colell, Whinston, and Green (1995) use 
price quasi-equilibrium concept to present a version of Second Fundamental Theorem of 
Welfare Economics under much weaker conditions, which only requires the convexities 
of preferences and production possibilities and local non-satiation of preferences. Thus, 
when w > 0, we can obtain the Second Theorem of Welfare Economics under the local 
non-satiation, continuity, and convexity of preferences. For the discussion, see Mas-Colell, 
Whinston, and Green (1995). 

For exchange economies, the competitive equilibrium with transfers is the same as a 
regular competitive equilibrium when Wi = x*. As a corollary, we have 

Corollary 8.5.1 Suppose x* > is Pareto optimal, suppose )pi are continuous, convex 
and strictly monotonic. Then, x* is a competitive equilibrium for the initial endowment 
Wi = x* . 

Remark 8.5.4 If can be represented by a concave and differentiable utility function, 
then the proof of the Second Fundamental Theorem of Welfare Economics can be much 
simpler. A sufficient condition for concavity is that the Hessian matrix is negative definite. 
Also note that monotonic transformation does not change preferences so that we may be 
able to transform a quasi-concave utility function to a concave utility function as follows, 
for example 



which is concave after monotonic transformation. 

Differentiation Version of the Second Fundamental Theorem of Welfare 
Economics for Exchange Economies 

Proof: If x* > is Pareto Optimal, then we have 



u(x,y) 



rrywhich is quasi-concave 





(8.27) 
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We want to show q is a competitive equilibrium price vector. To do so, we only need to 
show that each consumer maximizes his utility s.t. B(q) = {xi G Xi : qxi ^ qx*). Indeed, 
by concavity of Ui 

Ui(xi) 5= u(x*) + Dui(x*) (xi - x*) 
= u(x*) + q(xi-x*)/ti 
Z u(x*) 

The reason the inequality holds for a concave function is because that, from Figure 3.8, 
we have 




Figure 8.8: Concave function 
Thus, we have Uj(xj) ^ u(x*) + Du^x*) (x { — x*). 
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8.6 Non- Convex Production Technologies and Marginal 
Cost Pricing 




Figure 8.9: Figure (a) shows failure of the second welfare theorem with a non-convex 
technology. Figure (b) shows the first welfare theorem applies even with a non-convex 
technology. 

The indispensability of convexity for the second welfare theorem can be observed in 
Figure 8.9(a). There, the allocation x* maximizes the welfare of the consumer, but for 
the only value of relative prices that could support utility-maximizing bundle, 

the firm does not maximize profits even locally (i.e., at the relative prices w/p, there are 
productions arbitrarily close to x* yielding higher profits). In contrast, the first welfare 
theorem remains applicable even in the presence of non-convexities. As Figure 8.9(b) 
suggests, any Walrasian equilibrium maximizes the well-being of the consumer in the 
feasible production set. 

The second welfare theorem runs into difficulties in the presence of non-convex pro- 
duction sets (here we do not question the assumption of convexity on the consumption 
side). In the first place, large non-convexities caused by the presence of fixed costs or 
extensive increasing returns lead to a world of a small number of large firms, making the 
assumption of price taking less plausible. Yet, even if price taking can somehow be relied 
on, it may still be impossible to support a given Pareto optimal allocation. Examples are 
provided by Figures 8.9(a) and 8.10. In Figure 8.10, at the only relative prices that could 
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support the production y* locally, the firm sustains a loss and would rather avoid it by 
shutting down. In Figure 8.9(a), on the other hand, not even local profit maximization 
can be guaranteed. 



Figure 8.10: The firm incurs a loss at the prices that locally support be Pareto optimal 
allocation. 

Although non-convexities may prevent us from supporting the Pareto optimal pro- 
duction allocation as a profit-maximizing choice, under the differentiability assumptions 
on pricing rules of firms, we can use the first-order necessary conditions derived there to 
formulate a weaker result that parallels the second welfare theorem. 

Theorem 8.6.1 (The Second Welfare Theorem for Non-Convex Production Sets) 

Suppose that firm's j transformation function Tj(y) are smooth, and all consumers have 
continuous, monotonic, and convex preferences. If (x*,y*) is Pareto optimal, then there 
exists a price vector p and wealth levels (ii, . . . , /„) with J2i h = V J2i w i + J2j P ' Vj suc ^ 




K 



. > ■ 

Firm's Loss 
- Measured in 
Units of Good 2 



that: 



(i) For any firm j , we have 



P = lj^Tj{y*) for some jj > 0. 



(ii) For any i,x* is maximal for )^iin the budget set 



{Xi G X[ :p-Xi < h}. 



(in) J2i x * = Ei m + J2j y*j- 
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The type of equilibrium represented by conditions (i) to (iii) of the above theorem is 
called a marginal cost price equilibrium with transfers. The motivation for this terminology 
comes from the one-output, one-input case. 

Condition (i) also implies that 

, 9Tl(y) dT l j{y) 

p k m(y) ••• dTjjy) > y°-^> 

Qyk Qyk 

and thus marginal rates of technic substitutions, MRTS, of two goods are equal for all 
firms. As we have noted, although condition (i) is necessary, but not sufficient, for profit 
maximization, and in fact, it does not imply that the (yl,...,y}) are profit-maximizing 
production plans for price-taking firms. The condition says only that small changes in 
production plans have no first-order effect on profit. But small changes may still have pos- 
itive second-order effects (as in Figure 8.9(a), where at a marginal cost price equilibrium 
the firm actually chooses the production that minimizes profits among the efficient pro- 
ductions) and, at any rate, large changes may increase profits (as in Figure 8.10). Thus, 
to achieve allocation (x*,y*) may require that a regulatory agency prevent the managers 
of non-convex firms from attempting to maximize profits at the given prices p. Recently, 
Tian (2010) presented such an incentive compatible mechanism that implements marginal 
cost price equilibrium allocations with transfers in Nash equilibrium. Tian (2009) also 
consider implementation of other equilibrium allocations when firms pricing rule is given 
by such as average-cost pricing, loss-free pricing, etc. 

It should be noted that the converse result to the above theorem, which would assert 
that every marginal cost price equilibrium is Pareto optimal, is not true. In Figure 8.11, 
for example, we show a one-consumer economy with a non-convex production set. In the 
figure, x* is a marginal cost price equilibrium with transfers for the price system p — (1,1). 
Yet, allocation x' yields the consumer a higher utility. Informally, this occurs because 
marginal cost pricing neglects second-order conditions and it may therefore happen that, 
as at allocation x* , the second-order conditions for the social utility maximization problem 
are not. A marginal cost pricing equilibrium need not be Pareto optimal allocation. As a 
result, satisfaction of the first-order marginal optimality conditions (which in the case of 
Figure 8.11 amounts simply to the tangency of the indifference curve and the production 
surface) does not ensure that the allocation is Pareto optimal. 
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Figure 8.11: The firm incurs a loss at the prices that locally support be Pareto optimal 
allocation. 

See Quinzii (1992) for extensive background and discussion on the material presented 
in this section. 

8.7 Pareto Optimality and Social Welfare Maximiza- 
tion 

Pareto efficiency is only concerned with efficiency of allocations and has nothing to say 
about distribution of welfare. Even if we agree with Pareto optimality, we still do not 
know which one we should be at. One way to solve the problem is to assume the existence 
of a social welfare function. 

Define a social welfare function W : X — > 3? by W(ui(xi), . . . ,u n (x n )) where we 
assume that W(-) is monotone increasing. 

Example 8.7.1 (Utilitarian Social Welfare Function) The utilitarian social welfare 
function is given by 

n 

1=1 

with Yli a i — = 0- Under a utilitarian rule, social states are ranked according to the 
linear sum of utilities. The utilitarian form is by far the most common and widely applied 
social welfare function in economics. 
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Example 8.7.2 (Rawlsian Social Welfare Function) The Rawlsian social welfare func- 
tion is defined by 

W(-) = min{u 1 (x 1 ),u 2 (x 2 ), u n (x n )}. 

So the utility function is not strictly monotonic increasing. The Rawlsian form gives 
priority to the interests of the worst off members, and it is used in the ethical system 
proposed by Rawls (1971). 

8.7.1 Social Welfare Maximization for Exchange Economies 

We suppose that a society should operate at a point that maximizes social welfare; that 
is, we should choose an allocation x* such that x* solves 

maxiy(wi(xi), . . . ,u n (x n )) 

subject to 

n n 

i=i i=i 

How do the allocations that maximize this welfare function compare to Pareto efficient 
allocations? The following is a trivial consequence if the strict monotonicity assumption 
is imposed. 

Proposition 8.7.1 Under strict monotonicity of the social welfare function, W(-) if x* 
maximizes a social welfare function, then x* must be Pareto Optimal. 

Proof: If x* is not Pareto Optimal, then there is another feasible allocation x' such that 
u i{ x i) = Ui(xi) for all i and Uk(x' k ) > Uk(xk) for some k. Then, by strict monotonicity 
of W(-), we have W(ui(x' 1 ), . . . , u n (x' n )) > W(ui(xi) : . . . , u n (x n )) and thus it does not 
maximizes the social welfare function. ■ 
Thus, every social welfare maximum is Pareto efficient. Is the converse necessarily 
true? By the Second Fundamental Theorem of Welfare Economics, we know that every 
Pareto efficient allocation is a competitive equilibrium allocation by redistributing endow- 
ments. This gives us a further implication of competitive prices, which are the multipliers 
for the welfare maximization. Thus, the competitive prices really measure the (marginal) 
social value of a good. Now we state the following proposition that shows every Pare- 
to efficient allocation is a social welfare maximum for the social welfare function with a 
suitable weighted sum of utilities. 
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Proposition 8.7.2 Let x* > be a Pareto optimal allocation. Suppose w L is concave, 
differentiable and strictly monotonic. Then, there exists some choice of weights a* such 
that x* maximizes the welfare functions 

n 

W(u!, . . . , u n ) = ajUijxi) (8.30) 
i=i 

Furthermore, a* = with Aj = dVl ^' Il ' ) 

where V^(-) is the indirect utility function of consumer %. 

Proof: Since x* is Pareto optimal, it is a competitive equilibrium allocation with 
Wi = x* by the second theorem of welfare economies. So we have 

D iUi (x*) = \p (8.31) 

by the first order condition, where p is a competitive equilibrium price vector. 
Now for the welfare maximization problem 



n 

max aiUi{Xi 
i=i 



X, 



since Ui is concave, x* solves the problem if the first order condition 

dui{xi) . 

Gi ^9x — = g l = 1 >---> n ( 8 - 32 ) 

is satisfied for some q. Thus, if we let p = q, then a* = We know also maximizes 
the welfare function Ym=i a i u i( x 'i)- ' 
Thus, the price is the Lagrangian multiples of the welfare maximization, and this 
measures the marginal social value of goods. 

8.7.2 Welfare Maximization in Production Economy 

Define a choice set by the transformation function 

n 

T{x) = with x = ^Xi. (8.33) 

i=i 
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Figure 8.12: Welfare maximization 

The social welfare maximization problem for the production economy is 

maxW(u 1 (x l ),u 2 {x 2 ), ■ ■ ■ ,u n (u n )) (8.34) 

subject to 

T(x) = 0. 

Define the Lagrangian function 

L = W( Ul ( Xl ), . . . , u n (u n )) - \T{x). (8.35) 

The first order condition is then given by 

du^x^ _ x dT\xl Q w . th equaUty . f x , > Q (g 36) 

OX ; OX j 



and thus when x is an interior point, we have 



dujjxj) dTjx) 



(8.37) 



That is, 

MRS x i vX k = MRTS xKxk . (8.38) 

The conditions characterizing welfare maximization require that the marginal rate of 
substitution between each pair of commodities must be equal to the marginal rate of 
transformation between the two commodities for all agents. 
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8.8 Political Overtones 



1. By the First Fundamental Theorem of Welfare Economics, implication is that what the 
government should do is to secure the competitive environment in an economy and give 
people full economic freedom. So, as long as the market system works well, there should 
be no subsidizing, no price floor, no price ceiling, stop a rent control, no regulations, lift 
the tax and the import-export barriers. 

2. Even if we want to reach a preferred Pareto optimal outcome which may be different 
from a competitive equilibrium from a given endowment, you might do so by adjusting the 
initial endowment but not disturbing prices, imposing taxes or regulations. That is, if a 
derived Pareto optimal is not "fair" , all the government has to do is to make a lump-sum 
transfer payments to the poor first, keeping the competitive environments intact. We 
can adjust initial endowments to obtain a desired competitive equilibrium by the Second 
Fundamental Theorem of Welfare Economics. 

3. Of course, when we reach the above conclusions, you should note that there are 
conditions on the results. In many cases, we have market failures, in the sense that either 
a competitive equilibrium does not exist or a competitive equilibrium may not be Pareto 
optimal so that the First or Second Fundamental Theorem of Welfare Economics cannot 
be applied. 

The conditions for the existence of a competitive equilibrium are: (i) convexity (diver- 
sification of consumption and no IRS), (ii) monotonicity (self-interest), and (iii) continuity, 
(iv) divisibility, (v) perfect competition, (vi) complete information, etc. If these condi- 
tions are not satisfied, we may not obtain the existence of a competitive equilibrium. 
The conditions for the First Fundamental Theorem of Welfare Economics are: (i) local 
non-satiation (unlimited desirability), (ii) divisibility, (iii) no externalities, (iv) perfect 
competition, (v) complete information etc. If these conditions are not satisfied, we may 
not guarantee that every competitive equilibrium allocation is Pareto efficient. The condi- 
tions for the Second Fundamental Theorem of Welfare Economics are: (i) the convexity of 
preferences and production sets, (ii) monotonicity (self-interest), and (iii) continuity, (iv) 
divisibility, (v) perfect competition, (vi) complete information, etc. If these conditions 
are not satisfied, we may not guarantee every Pareto efficient allocation can be supported 
by a competitive equilibrium with transfers. 
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Thus, as a general notice, before making an economic statement, one should pay atten- 
tion to the assumptions which are implicit and/or explicit involved. As for a conclusion 
from the general equilibrium theory, one should notice conditions such as divisibility, no 
externalities, no increasing returns to scale, perfect competition, complete information. 
If these assumptions are relaxed, a competitive equilibrium may not exist or may not be 
Pareto efficient, or a Pareto efficient allocation may not be supported by a competitive 
equilibrium with transfer payments. Only in this case of a market failure, we may adopt 
another economic institution. We will discuss the market failure and how to solve the 
market failure problem in Part IV and Part V. 
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Chapter 9 

Economic Core, Fair Allocations, 
and Social Choice Theory 

9.1 Introduction 

In this chapter we briefly discuss some topics in the framework of general equilibrium 
theory, namely economic core, fair allocations, and social choice theory. The theory 
of core is important because it gives an insight into how a competitive equilibrium is 
achieved as a result of individual strategic behavior instead of results of an auctioneer 
and the Walrasian tatonnement mechanism. It shows the necessity of adopting a market 
instituion as long as individuals behavor self-interestedly. 

We have also seen that Pareto optimality may be too weak a criterion to be meaningful. 
It does not address any question about income distribution and equity of allocations. 
Fairness is a notion to overcome this difficulty. This is one way to restrict a set of Pareto 
optimum. 

In a slightly different framework, suppose that a society is deciding the social priority 
among finite alternatives. Alternatives may be different from Pareto optimal allocations. 
Let us think of a social "rule" to construct the social ordering (social welfare function) 
from many individual orderings of different alternatives. The question is: Is it possible 
to construct a rule satisfying several desirable properties? Both "fairness" and "social 
welfare function" address a question of social justice. 
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9.2 The Core of Exchange Economies 

The use of a competitive (market) system is just one way to allocate resources. What 
if we use some other social institution? Would we still end up with an allocation that 
was "close" to a competitive equilibrium allocation? The answer will be that, if we allow 
agents to form coalitions, the resulting allocation can only be a competitive equilibrium 
allocation when the economy becomes large. Such an allocation is called a core allocation 
and was originally considered by Edgeworth (1881). 

The core is a concept in which every individual and every group agree to accept an 
allocation instead of moving away from the social coalition. 

There is some reason to think that the core is a meaningful political concept. If a 
group of people find themselves able, using their own resources to achieve a better life, it 
is not unreasonable to suppose that they will try to enforce this threat against the rest of 
community. They may find themselves frustrated if the rest of the community resorts to 
violence or force to prevent them from withdrawing. 

The theory of the core is distinguished by its parsimony. Its conceptual apparatus 
does not appeal to any specific trading mechanism nor does it assume any particular 
institutional setup. Informally, the notion of competition that the theory explores is 
one in which traders are well informed of the economic characteristics of other traders, 
and in which the members of any group of traders can bind themselves to any mutually 
advantageous agreement. 

For simplicity, we consider exchange economies. We say two agents are of the same 
type if they have the same preferences and endowments. 

The r-replication of the original economy: There are r times as many agents of each 
type in the original economy. 

A coalition is a group of agents, and thus it is a subset of n agents. 

Definition 9.2.1 (Blocking Coalition) A group of agents S (a coalition) is said to 
block (improve upon) a given allocation x if there is a Pareto improvement with their 
own resources, that is, if there is some allocation x' such that 

(1) it is feasible for S, ie., Y^ies x 'i = ^2ies w ^ 

(2) x\ tpi Xi for all i G S and x' k Xk for some k G S . 
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Definition 9.2.2 (Core) A feasible allocation x is said to have the core property if it 
cannot be improved upon for any coalition. The core of an economy is the set of all 
allocations that have the core property. 

Remark 9.2.1 Every allocation in the core is Pareto optimal (coalition by whole people). 

Definition 9.2.3 (Individual Rationality) An allocation x is individually rational if 
Xi >pi Wi for alH = 1, 2, . . . , n. 

The individual rationality condition is also called the participation condition which means 
that a person will not participate the economic activity if he is worse off than at the initial 
endowment. 

Remark 9.2.2 Every allocation in the core must be individually rational. 

Remark 9.2.3 When n = 2 and preference relations are weakly monotonic, an allocation 
is in the core if and only if it is Pareto optimal and individually rational. 







Ob 








1/ 



Figure 9.1: The set of allocations in the core is simply given by the set of Pareto efficient 
and individually rational allocations when n = 2. 



Remark 9.2.4 Even though a Pareto optimal allocation is independent of individual 
endowments, an allocation in the core depends on individual endowments. 
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What is the relationship between core allocations and competitive equilibrium alloca- 
tions? 



Theorem 9.2.1 Under local non-satiation, if (x,p) is a competitive equilibrium, then x 
has the core property. 

Proof: Suppose x is not an allocation in the core. Then there is a coalition S and a 
feasible allocation x' such that 



and x\ )pi Xi for all i G S, x' k >~k Xk for some k G S. Then, by local non-satiation, we have 

px\ ^ pxi for alH G S and 



a contradiction. Therefore, the competitive equilibrium must be an allocation in the core. 

We refer to the allocations in which consumers of the same type get the same con- 
sumption bundles as equal-treatment allocations. It can be shown that any allocation in 
the core must be an equal-treatment allocation. 

Proposition 9.2.1 (Equal Treatment in the Core) Suppose agents' preferences are 
strictly convex. Then i/x is an allocation in the r-core of a given economy, then any two 
agents of the same type must receive the same bundle. 

Proof: Let x be an allocation in the core and index the 2r agents using subscripts 
Al, . . . , Ar and Bl, . . . ., Br. If all agents of the same type do not get the same allocation, 
there will be one agent of each type who is most poorly treated. We will call these two 
agents the "type-A underdog" and the "type-B underdog." If there are ties, select any of 
the tied agents. 

Let xa = \ Y^j=i x Aj and xb = \ Y^ 1 j=i x B j be the average bundle of the type-A and 
type-B agents. Since the allocation x is feasible, we have 




(9.1) 



px' k > pxk for some k 



Therefore, we have 




(9.2) 
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It follows that 

X A + X B = OJ A + UB, 

so that (xa,xb) is feasible for the coalition consisting of the two underdogs. We are as- 
suming that at least for one type, say type A, two of the type- A agents receive different 
bundles. Hence, the A underdog will strictly prefer x A to his present allocation by strict 
convexity of preferences (since it is a weighted average of bundles that are at least as good 
as xa), and the B underdog will think x B is at least as good as his present bundle, thus 
forming a coalition that can improve upon the allocation. The proof is completed. 

Since any allocation in the core must award agents of the same type with the same 
bundle, we can examine the cores of replicated two-agent economies by use of the Edge- 
worth box diagram. Instead of a point x in the core representing how much A gets and 
how much B gets, we think of x as telling us how much each agent of type A gets and 
how much each agent of type B gets. The above proposition tells us that all points in the 
r-core can be represented in this manner. 

The following theorem is a converse of Theorem 9.2.1 and shows that any allocation 
that is not a market equilibrium allocation must eventually not be in the r-core of the 
economy. This means that core allocations in large economies look just like Walrasian 
equilibria. 

Theorem 9.2.2 (Shrinking Core Theorem) Suppose >pi are strictly convex and con- 
tinuous. Suppose x* is a unique competitive equilibrium allocation. Then, if y is not a 
competitive equilibrium, there is some replication V such that y is not in the V -core. 

Proof: We prove the theorem with the aid of Figure 9.2. From the figure, one can see 
that y is not a competitive equilibrium. We want to show that there is a coalition such 
that the point y can be improved upon for V-replication. Since y is not a competitive 
equilibrium, the line segment through y and w must cut at least one agent's, say agent 
A's, indifference curve through y. Then, by strict convexity and continuity of there 
are integers V and T with < T < V such that 

T T 

9a = T~ r w A + (1 - —)y A y A y A . 
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We can do so since any real number can be approached by a rational number that consists 
of the ratio of two suitable integers y. 




Figure 9.2: The shrinking core. A point like y will eventually not be in the core. 

Now, form a coalition consisting of V consumers of type A and V — T consumers of 
type B and consider the allocation x = (gA, ■ ■ ■ Qai Ub, ■ ■ ■ 2/s)(i n which there are V Type 
A and V — T Type B). We want to show x is feasible for this coalition. 



Vg A + (V-T)y B = V 



T ( T 

w A + 1 - tjVa 



+ (V- T)y B 



V \ V 

= Tw A + (V -T)y A + (V -T)y B 

= Tw A + (V -T){y A + y B ) 
= Tw A + (V -T)(w A + w B ) 
= Vw A + (V -T)w B 

by noting y A + y B = wa + w B . Thus, x is feasible in the coalition and qa >~a Ua for 
all agents in type A and y B ~ B y B for all agents in type B which means y is not in the 
V-coie for the ^-replication of the economy. The proof is completed. 

Remark 9.2.5 The shrinking core theorem then shows that the only allocations that are 
in the core of a large economy are market equilibrium allocations, and thus Walrasian 
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equilibria are robust: even very weak equilibrium concepts, like that of core, tend to yield 
allocations that are close to Walrasian equilibria for larger economies. Thus, this theorem 
shows the essential importance of competition and fully economic freedom. 

Remark 9.2.6 Many of the restrictive assumptions in this proposition can be relaxed 
such as strict monotonicity, convexity, uniqueness of competitive equilibrium, and two 
types of agents. 

From the above discussion, we have the following limit theorem. 

Theorem 9.2.3 (Limit Theorem on the Core) Under the strict convexity and con- 
tinuity, the core of a replicated two person economy shrinks when the number of agents 
for each type increases, and the core coincides with the competitive equilibrium allocation 
if the number of agents goes to infinity. 

This result means that any allocation which is not a competitive equilibrium allocation 
is not in the core for some r-replication. 

9.3 Fairness of Allocation 

Pareto efficiency gives a criterion of how the goods are allocated efficiently, but it may be 
too weak a criterion to be meaningful. It does not address any questions about income 
distribution, and does not give any "equity" implication. Fairness is a notion that may 
overcome this difficulty. This is one way to restrict the whole set of Pareto efficient 
outcomes to a small set of Pareto efficient outcomes that satisfy the other properties. 

What is the equitable allocation? 

How can we define the notion of equitable allocation? 

Definition 9.3.1 (Envy) An agent % is said to envy agent k if agent i prefers agent fc's 
consumption, i.e., Xk >-« %i- 

Definition 9.3.2 An allocation x is equitable if no one envies anyone else, i.e., for each 
% G N, %i )pi Xk for all k G N. 

Definition 9.3.3 (Fairness) An allocation x is said to be fair if it is both Pareto optimal 
and equitable. 
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Figure 9.3: A fair allocation. 



Remark 9.3.1 By the definition, a set of fair allocations is a subset of Pareto efficient 
allocations. Therefore, fairness restricts the size of Pareto optimal allocations. 

The following strict fairness concept is due to Lin Zhou (JET, 1992, 57: 158-175). 
An agent i envies a coalition S (i S) at an allocation x if xs >~i %i, where x~s = 

Definition 9.3.4 An allocation x is strictly equitable or strictly envy-free if no one envies 
any other coalitions. 

Definition 9.3.5 (Strict Fairness) An allocation x is said to be strictly fair if it is both 
Pareto optimal and strictly equitable. 

Remark 9.3.2 The set of strictly fair allocations are a subset of Pareto optimal alloca- 
tions. 

Remark 9.3.3 For a two person exchange economy, if x is Pareto optimal, it is impossible 
for two persons to envy each other. 

Remark 9.3.4 It is clear every strictly fair allocation is a fair allocation, but the converse 
may not be true. However, when n = 2, a fair allocation is a strictly fair allocation. 
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The following figure shows that x is Pareto efficient, but not equitable. 
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Figure 9.4: x is Pareto efficient, but not equitable. 
The figure below shows that x is equitable, but it is not Pareto efficient. 




Figure 9.5: x is equitable, but not Pareto efficient. 

How to test a fair allocation? 

Graphical Procedures for Testing Fairness: 

Let us restrict an economy to a two-person economy. An easy way for agent A to 
compare his own allocation x\ with agent -B's allocation xb in the Edgeworth Box is to 
find a point symmetric of x A against th center of the Box. That is, draw a line from xa 
to the center of the box and extrapolate it to the other side by the same length to find 
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x' A , and then make the comparison. If the indifference curve through xa cuts "below" x' A , 
then A envies B. Then we have the following way to test whether an allocation is a fair 
allocation: 

Step 1: Is it Pareto optimality? If the answer is "yes", go to step 2; if no, 
stop. 

Step 2: Construct a reflection point (xb,%a ). (Note that is the 

center of the Edgeworth box.) 

Step 3: Compare xb with for person A to see if Xb >~a xa and compare 
xa with xb for person B to see if xa >~b %b- If the answer is "no" for both 
persons, it is a fair allocation. 




Figure 9.6: How to test a fair allocation. 

We have given some desirable properties of "fair" allocations. A question is whether 
it exists at all. The following theorem provides one sufficient condition to guarantee the 
existence of fairness. 

Theorem 9.3.1 Let (x*,p*) be a competitive equilibrium. Under local non-satiation, if 
all individuals' income is the same, i.e., p*W\ = p*u> 2 = . . . = p*w n , then x* is a strictly 
fair allocation. 
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Proof: By local non-satiation, x* is Pareto optimal by the First Fundamental Theorem 
of Welfare Economics. We only need to show x* is strictly equitable. Suppose not. There 
is % and a coalition S with i £ S such that 

1^1 fees 

Then, we have p*x* s > p*x* = p*Wi. But this contradicts the fact that 

p x s = Js\l^ p Xk = Ts\l^ p Wk = p Wi ( 9A > 

' ' kes ' ' kes 

by noting that p*Wi = p*w 2 = . . . = p*w n . Therefore, it must be a strictly fair allocation. 

Definition 9.3.6 An allocation x G 9ft™ L is an equal income Walrasian allocation if there 
exists a price vector such that 

(1) pxi ^ pw, where w = \ Y^l=i w k '■ average endowment. 

(2) x\ >~i Xi implies px\ > pw. 

(3) J2 x i = E^i- 

Notice that every equal Walrasian allocation a; is a competitive equilibrium allocation 
with Wi = w for all i. 

Corollary 9.3.1 Under local non-satiation, every equal income Walrasian allocation is 
strictly fair allocation. 

Remark 9.3.5 An "equal" division of resource itself does not give "fairness," but trading 
from "equal" position will result in "fair" allocation. This "divide-and-choose" recom- 
mendation implies that if the center of the box is chosen as the initial endowment point, 
the competitive equilibrium allocation is fair. A political implication of this remark is 
straightforward. Consumption of equal bundle is not Pareto optimum, if preferences are 
different. However, an equal division of endowment plus competitive markets result in 
fair allocations. 

Remark 9.3.6 A competitive equilibrium from an equitable (but not "equal" division) 
endowment is not necessarily fair. 
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Remark 9.3.7 Fair allocations are defined without reference to initial endowments. S- 
ince we are dealing with optimality concepts, initial endowments can be redistributed 
among agents in a society. 

In general, there is no relationship between an allocation in the core and a fair alloca- 
tion. However, when the social endowments are divided equally among two persons, we 
have the following theorem. 

Theorem 9.3.2 In a two-person exchange economy, if are convex, and if the total 
endowments are equally divided among individuals, then the set of allocations in the core 
is a subset of (strictly) fair allocation. 

Proof: We know that an allocation in the core x is Pareto optimal. We only need to show 
that x is equitable. Since x is a core allocation, then x is individually rational (everyone 
prefers initial endowments). If x is not equitable, there is some agent i, say, agent A, such 
that 

x B >~a x a w a = ^(™a + w b ) 

l , 

= ^{xa + xb) 

by noting that wa = wb and x is feasible. Thus, xa ^a \{xa + xb)- But, on the other 
hand, since )p A is convex, x B y A xa implies that \{x A + x B ) y A x a, a contradiction. 

9.4 Social Choice Theory 
9.4.1 Introduction 

In this section, we present a vary brief summary and introduction of social choice theory. 
We analyze the extent to which individual preferences can be aggregated into social pref- 
erences, or more directly into social decisions, in a "satisfactory" manner (in a manner 
compatible with the fulfilment of a variety of desirable conditions). 

As was shown in the discussion of "fairness," it is difficult to come up with a criterion 
(or a constitution) that determines that society's choice. Social choice theory aims at 
constructing such a rule which could be allied with not only Pareto efficient allocations, 
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but also any alternative that a society faces. We will give some fundamental results of 
social choice theory: Arrow Impossibility Theorem, which states there does not exist any 
non-dictatorial social welfare function satisfying a number of "reasonable" assumptions. 
Gibbard-Satterthwaite theorem states that no social choice mechanism exists which is 
non-dictatorial and can never be advantageously manipulated by some agents. 

9.4.2 Basic Settings 

To simplify the exposition, we will return to the notation used in pre-orders of preferences. 
Thus aPib means that % strictly prefers a to b. We will also assume that the individual 
preferences are all strict, in other words, that there is never a situation of indifference. 
For all i, a and b, one gets preference aPib or bPid (this hypothesis is hardly restrictive if 
A is finite). 

N = {1,2, ...n} : the set of individuals; 

X = {xi,X2, ■■■x m ) : (m ^ 3): the set of alternatives (outcomes); 

Pi = (or >-j) : strict preference orderings of agent i; 

Vi the class of allowed individual orderings; 

P = (Pi, P 2 , P n ) : a preference ordering profile; 

V: the set of all profiles of individuals orderings; 

S(X) : the class of allowed social orderings. 

Arrow's social welfare function: 



which is a mapping from individual ordering profiles to social orderings. 

Gibbard-Satterthwaite's social choice function (SCF) is a mapping from individual 
preference orderings to the alternatives 



Note that even though individuals' preference orderings are transitive, a social prefer- 
ence ordering may not be transitive. To see this, consider the following example. 



F : V -> S(X) 



(9.5) 



f-V^X 



(9.6) 
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Example 9.4.1 (The Condorcet Paradox) Suppose a social choice is determined by 
the majority voting rule. Does this determine a well-defined social welfare function? The 
answer is in general no by the well-known Condorcet paradox. Consider a society with 
three agents and three alternatives: x, y, z. Suppose each person's preference is given by 

x y± y y± z (by person 1) 

y y 2 z >~2 x (by person 2) 

z >~3 x ^ 3 y (by person 3) 

By the majority rule, 

For x and y, xFy (by social preference) 
For y and z, yFz (by social preference) 
For x and z, zFx (by social preference) 

Then, pairwise majority voting tells us that x must be socially preferred to y, y must 
be socially preferred to z, and z must be socially preferred to x. This cyclic pattern means 
that social preference is not transitive. 

The number of preference profiles can increase very rapidly with increase of number 
of alternatives. 

Example 9.4.2 X = {x,y, z}, n = 3 
x y y y z 
x y z y y 
y y x y z 
y y z y x 
z y x y y 
z y y y x 

Thus, there are six possible individual orderings, i.e., \Vi\ = 6, and therefore there are 
\Vi\ x \V 2 \ x l^l = 6 3 = 216 possible combinations of 3-individual preference orderings 
on three alternatives. The social welfare function is a mapping from each of these 216 
entries (cases) to one particular social ordering (among six possible social orderings of 
three alternatives). The social choice function is a mapping from each of these 216 cases 
to one particular choice (among three alternatives). A question we will investigate is what 
kinds of desirable conditions should be imposed on these social welfare or choice functions. 
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You may think of a hypothetical case that you are sending a letter of listing your 
preference orderings, Pi, on announced national projects (alternatives), such as reducing 
deficits, expanding the medical program, reducing society security program, increasing 
national defence budget, to your Congressional representatives. The Congress convenes 
with a huge stake of letters P and try to come up with national priorities F(P). You want 
the Congress to make a certain rule (the Constitution) to form national priorities out of 
individual preference orderings. This is a question addressed in social choice theory. 

9.4.3 Arrow's Impossibility Theorem 

Definition 9.4.1 Unrestricted Domain (UD): A class of allowed individual orderings V% 
consists of all possible orderings defined on X. 

Definition 9.4.2 Pareto Principle (P): if for x,y G X, xPi y for all i G N, then xF(P)y 
(social preferences). 

Definition 9.4.3 Independence of Irrelevant Alternatives (HA): For any two alternatives 
x, y, G X and any two preference profiles P, P' G V n , xF(P)y and {i : xP-iy} = {i : xP-y} 
for all % implies that xF(P')y. That is, the social preference between any two alternatives 
depends only on the profile of individual preferences over the same alternatives. 

Remark 9.4.1 IIA means that the ranking between x and y for any agent is equivalent 
in terms of P and P' implies that the social ranking between x and y by F(P) and F(P') 
is the same. In other words, if two different preference profiles that are the same on x 
and y, the social order must be also the same on x and y. 

Remark 9.4.2 By IIA, any change in preference ordering other than the ordering of x 
and y should not affect social ordering between x and y. 

Example 9.4.3 Suppose x Pi y Pi z and x P[ z P[ y. 
By IIA, if xF{P)y, then xF(P')y. 

Definition 9.4.4 (Dictator) There is some agent i e N such that F(P) = Pi for all 
P G [V n , and agent % is called a dictator. 
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Theorem 9.4.1 (Arrow's Impossibility Theorem) Any social welfare function that 
satisfies m ^ 3, UD, P, IIA conditions is dictatorial. 

Proof: The idea for the following proof dates back to Vickrey (1960); it comprises 
three lemmas. 

Lemma 9.4.1 (Neutrality) Given a division N = M U / and (a,b,x,y) G A 4 such that 

•Vi G M,xPiy and aP-b 
•Vi G I,yP{X and bP[a 

then 

xPy aP'b (9.7) 

xly & al'b (9.8) 



Proof: The interpretation of lemma 9.4.1 is simple: If x and y are ordered by each 
individual in P as a and b in P', then this must also be true of social preference; that is, x 
and y must be ordered by P as a and b by P' . If this weren't the case, then the procedure 
of aggregation would treat the pairs (a, b) and (x, y) in a non-neutral manner, hence the 
name of the lemma. 

Let us first prove (9.7). Suppose that xP'y and that (a,b,x,y) are distinct two by 
two. Let preferences P" be such that 

■Mi G M,aP"xP"yP"b 
•Vi G I,yPl'bPl'aPl'x 

(such preferences exist because the domain is universal). By IIA, since x and y are 
ordered individually in P" as in P, we have xP"y. By the Pareto principle, aP"a; and 
yP"b. By transitivity, we get aP"b. Finally, in reapplying IIA, we find that aP'b. The 
cases where a or b coincide with x or y are treated similarly. 

Part (9.8) is obtained directly. If xly but, for example aP'b, we find the contradiction 
xPy by (9.7) in reversing the roles of (a,b) and (x,y). 
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Before stating Lemma 9.4.2, two terms must be defined. We say that a set of agents 
M is almost decisive on (x, y) if for every P, 

(yi G M, xPiy and Vi M, yPix) =4> xPy 

We say that M is decisive on (x, y) if for every P, 

(Vi G Af , xP^y) xPy 

Lemma 9.4.2 If M is almost decisive on (x,y), then it is decisive on (x,y). 

Proof: Suppose that Wi G M,xPiy. By IIA, only individual preferences count on (x,y); 
the others can be changed. Assume therefore, with no loss of generality, that z exists 
such that xPizPiy if i G M and z is preferred to x and y by the other agents. Neutrality 
imposes that xPz because the individual preferences are oriented as on (x, y) and M is 
almost decisive on (x,y). Finally, the Pareto principle implies that zPy and transitivity 
implies the conclusion that xPy. 

Note that neutrality implies that if M is decisive on (x, y), it is decisive on every other 
pair. We will therefore just say that M is decisive. 

Lemma 9.4.3 If M is decisive and contains at least two agents, then a strict subset of 
M exists that is decisive. 

Proof: Divide M = Mi U M2, and choose a P such that 

•on M 1 ,xP i yP i z 
•on M 2 ,yPizPiX 
•outside M, zPixPiy 

As M is decisive, we have yPz. 
One of two things results: 

• Either yPx, and (since z doesn't count by IIA) M 2 is almost decisive on (x, y), and 
therefore decisive by lemma 9.4.2. 

• Or xRy, and by transitivity xPz; then (y doesn't count by IIA) M 1 is almost decisive 
on (x,y), and therefore is decisive by lemma refsoc2. 
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The proof is concluded by noting that by the Pareto principle, / as a whole is decisive. 
In apply Lemma 9.4.3, the results is an individual i that is decisive, and this is therefore 
the sought-after dictator. The attentive reader will have noted that lemma 9.4.1 remains 
valid when there are only two alternatives, contrary to Lemmas 9.4.2 and 9.4.3. The proof 
of the theorem is completed. 

The impact of the Arrow possibility theorem has been quite substantial. Obviously, 
Arrow's impossibility result is a disappointment. The most pessimistic reaction to it is 
to conclude that there is just no acceptable way to aggregate individual preferences, and 
hence no theoretical basis for treating welfare issues. A more moderate reaction, however, 
is to examine each of the assumptions of the theorem to see which might be given up. 
Conditions imposed on social welfare functions may be too restrictive. Indeed, when some 
conditions are relaxed, then the results could be positive, e.g., UD is usually relaxed. 

9.4.4 Some Positive Result: Restricted Domain 

When some of the assumptions imposed in Arrow's impossibility theorem is removed, the 
result may be positive. For instance, if alternatives have certain characteristics which 
could be placed in a spectrum, preferences may show some patterns and may not exhaust 
all possibilities orderings on X, thus violating (UD). In the following, we consider the case 
of restricted domain. A famous example is a class of "single-peaked" preferences. We 
show that under the assumption of single-peaked preferences, non-dictatorial aggregation 
is possible. 

Definition 9.4.5 A binary relation > on X is a linear order if > is reflexive (x > x), 
transitive (x > y > z implies x > z), and total (for distinct x,y G X, either x > y or 
y > x, but not both). 

Example: X = 3? and x ^ y. 

Definition 9.4.6 )pi is said to be single-peaked with respect to the linear order > on X, 
if there is an alternative x G X such that >pi is increasing with respect to > on the lower 
contour set L{x) = {y G X : y < x} and decreasing with respect to > on the upper 
contour set U(x) = {y G X : y > x}. That is, 
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(1) x > z > y implies z >-i y 

(2) y > z > x implies z >~i y 

In words, there is an alternative that represents a peak of satisfaction and, moreover, 
satisfaction increases as we approach this peak so that, in particular, there cannot be any 
other peak of satisfaction. 




Figure 9.7: u in the left figure is single-peaked, u in the right figure is not single-peaked. 
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Figure 9.8: Five Agents who have single-peaked preferences. 

Given a profile of preference (>!,..., £= n ), let Xi be the maximal alternative for ^ 
(we will say that xi is "individual i's peak"). 

Definition 9.4.7 Agent G # is a median agent for the profile (£=i, . . . , )p n ) if #{i G 
N : Xi> x h } > | and #{i G iV : x h > Xi } > f . 

Proposition 9.4.1 Suppose that > is a linear order on X , ^ is single-peaked. Let h <E N 
be a median agent, then the majority rule F()p) is aggregatable: 

x h F{y)y VyGX 

That is, the peak Xh of the median agent is socially optimal ( cannot be defeated by any 
other alternative) by majority voting. Any alternative having this property is called a 
Condorect winner. Therefore, a Condorect winner exists whenever the preferences of all 
agents are single-peaked with respect to the same linear order. 

Proof. Take any y G X and suppose that x h > y (the argument is the same for 
y > Xh)- We need to show that 

G N : x h ^iy} >#{i E N : y ^ x h }. 

Consider the set of agents S C N that have peaks larger than or equal to x h) that is, 
S = {i G N : Xi > Xh}- Then Xi > Xh > y for every % G S. Hence, by single-peakness of 
)pi with respect to >, we get Xh >~i y for every i G S and thus #S ^ G iV : Xh >~i y}- 
Hence, {i e N : y y { x h } C (N \ S), and then #{i G iV : y y { x h } ^ #(iV \ S). On the 
other hand, because agent h is a median agent, we have that #S > n/2 and therefore 
#{* G iV : y y t x h } ^ #(N \ S) ^ n/2 ^ #S ^ #{i G iV : x h ^ y}. 

9.4.5 Gibbard-Satterthwaite Impossibility Theorem 

The task we set ourselves to accomplish in the previous sections was how to aggregate 
profiles of individuals preference relations into a rational social preference order. Presum- 
ably, this social preference order is then used to make decisions. In this section we focus 
directly on social decision and pose the aggregation question as one analyzing how profiles 
of individual preferences turn into social decisions. 
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The main result we obtain again yields a dictatorship conclusion. The result amounts 
in a sense, to a translation of the Arrow's impossibility theorem into the language of choice 
functions. It provides a link towards the incentive-based analysis in the mechanism design 
we will discuss in Part V. 

Definition 9.4.8 A social choice function (SCF) is manipulable at P e V n if there exists 
P[ EV such that 

/(P_i, PT) Pi f{P-i, Pi) (9.9) 
where P^ = (P u . . . , P_i, P m , . . . , P n ). 

Definition 9.4.9 A SCF is strongly individually incentive compatible (SIIC) if there 
exists no preference ordering profile at which it is manipulable. In other words, the truth 
telling is a dominant strategy equilibrium: 

f(P'_i,Pi)Pif(PU,Pl) for all P'eV n (9.10) 

Definition 9.4.10 A SCF is dictatorial if there exists an agent whose optimal choice is 
the social optimal. 

Theorem 9.4.2 (Gibbard-Satterthwaite Theorem) If X has at least 3 alternatives, 
a SCF which is SIIC and UD is dictatorial. 

Proof. There are several the ways to prove the Gibbard-Satterthwaite's Impossibility 
Theorem. The proof adopted here is due to Schmeidler-Sonnenschein (1978). It has the 
advantage of bringing to light a close relationship to Arrow's Impossibility Theorem. 

We want to prove that if such a mechanism is manipulable, then / is dictatorial. The 
demonstration comprises two lemmas; it consists of starting from an implementable SCF 
/ and of constructing from it an SWF F that verifies Arrow's conditions. One concludes 
from this that F is dictatorial, which implies that / also is. 

Lemma 9.4.4 Suppose that f(P) = a± and /(P/,P_j) = a-i, where a 2 ^ a±. Then 

1. f is manipulable by i in (P/,P_j) if a\P[a<i 

2. f is manipulable by i in P if aiPiQ,\ 
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Proof In both cases it is sufficient to write the definition of manipulability. 



We will need the notation P/, which, for a profile P and given agents i < j, will 
represent the vector (P i: . . . , P,). 

Lemma 2 Let B be a subset of the image of / and P a profile such that 

• Vai £ B, Vei 2 ^ B, Mi — 1, . . . , n, eiip, ei 2 
Then /(P) G P. 

Proof This can be shown by contradiction. Let ei 2 = f{P), and suppose that ei 2 ^ P. 
Let P' be a profile such that f(P') = eii G B (such a profile does exist, since B is included 
in the image of / and given the universal domain hypothesis). Now construct a sequence 
(4)i=o,...,n by 

• al = a 2 B 

for i = l,...,n-l, 4 = f(P«,P? +1 ) 
a% = ai G B 

Let j be the first integer such that a J 3 G B. We then get 
. f(P?,P? +1 ) = 4eB 

and by the hypothesis of the lemma, a^P,a^ _1 . Lemma 1 then implies that / is manipu- 
late. 

Now construct an SWF F. Let P be any profile and eii,ei 2 two choices in A. Define a 
new profile (using UD) P such that for each i, 

• coincides with P, on {ai,a 2 } 

• p, coincides with P, on A — {ci 1? a 2 } 

• {c*i, ci 2 } is placed at the top of the preferences p 

(Strictly speaking, P of course depends on eii and ei 2 , and the notation should reflect 
this.) 
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Lemma 2 implies that f(P) G {01,02} (taking B = {ai,a 2 } and replacing P by P in 
the statement of the lemma). F can therefore be defined by 

• ai F{P)a 2 & f(P) = ai 

Now we can verify Arrow's conditions: There are surely at least three choices. 

• F is, by construction, of universal domain. 

• F satisfies the Pareto principle: if for every i, a\Pia 2) then a\ is at the top of all 
preferences Pj. By taking B = {01,02} in the statement of lemma 2, we indeed get 
/(P) = 0l . 

• P satisfies //A: if this were not the case, there would exist P, P', a\ and a 2 such 
that 

for every i, aiPia 2 ^ aiP-a 2 

• a l F{P)a 2 and a 2 F(P) ai 

Now define a sequence (a|)j = o,..., n by 

• 03 = Oi 

fori = l,...,n-l,a* =/(P« P^J 
03 = a 2 

Lemma 2 implies that a 3 G {01,02} for every i. Therefore let j be the first integer 
such that a 3 = a 2 . This gives /(P^, PjVi) = a 2 and /(P^ -1 , PJ 1 ) = 01. Now one of two 
things can result: 

• a x Pja 2 

This implies aiP'^a 2 and therefore a\P'^a 2 , so lemma 1 implies that / is manipulable. 

• a 2 Pjai 



314 



This implies a 2 Pja±, so lemma 1 again implies that / is manipulable. 

But there is contradiction in both results: for every P, P(P) is clearly a complete and 
asymmetrical binary relation. What remains for us is to verify that it is transitive. 

Take the opposite case so that we have a cycle on a triplet {a±, a 2 , a 3 }. For every i, let 
P/, which coincides with p, on {a±, a 2 , a 3 } and on A — {ai, a 2 , a 3 } be such that {ai, a 2 , 03} 
is at the top of P/ (using UD). Lemma 2 implies that f(P') G {ai,a 2 ,a 3 }; without any 
loss of generality, we can assume that f(P') = ai- Since F(P) has a cycle on {a 1: a 2: a 3 }, 
we necessarily get a 2 F(P)a\ or a 3 F(P)ai. Here again, without loss of generality, we can 
assume that a^F{P)a\. Now modify P' in P" by making a 2 move into third place in each 
individual preference (P" is admissible by UD). Note that a 3 Pj£ii if and only if a 3 P"ai] 
in applying II A (which we have just shown is satisfied), we get a^F{P")a\, which again 
implies a 3 = f(P"). 

At the risk of seeming redundant, we now define a sequence (a 4 )j =0 ,... in by 

• a° A = ai 

fori = l,...,n-l,4 = /(P^,^ 1 ) 
a" = a 3 

Lemma 2 implies that a 4 G {ai,a 2 ,a 3 } for every i. Therefore let j be the first integer 
such that a\ 7^ a\. One of two things results: 

• a\ — a 2 

but a^'a, since a 2 is only in third position in Pj'. Therefore P^P? f(P" j , Pf^), 

so / is manipulable. 

• a 4 = a 3 

Now, if a!Pja 3 , we also have a^'V Therefore f(P , i i ~ 1 ,P) n )Pjf(P'i j ,Pj+ 1 ), and 
/ is manipulable. If a 3 Pja!, we directly get /(P[ ', P^ 1 )P ? 7(P{ /J_1 , Pf), and / is still 
manipulable. We are led therefore to a contradiction in every case, which shows that 
P(P) is transitive. 

Since F verifies all of Arrow's conditions, F must be dictatorial; let i be the dictator. 
Let P be any profile and arrange the choices in such a way that a\Pia 2 Pi ■ ■ ■ Since % is the 
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dictator, more precisely we have aiF(P)a>2 and therefore f(P) = ai- But, by construction, 
P coincides with P and f(P) is therefore ai, the preferred choice of i, which concludes 
the proof showing that % is also a dictator for /. 

The readers who are interested in the other ways of proofs are referred to Mas-Colell, 
Whinston, and Green (1995). 
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Chapter 10 

General Equilibrium Under 
Uncertainty 

10.1 Introduction 

In this chapter, we apply the general equilibrium framework developed in Chapters 7 to 9 
to economic situations involving the exchange and allocation of resources under conditions 
of uncertainty. 

We begin by formalizing uncertainty by means of states of the world and then introduce 
the key idea of a contingent commodity: a commodity whose delivery is conditional on 
the realized state of the world. We then use these tools to define the concept of an 
Arrow-Debreu equilibrium. This is simply a Walrasian equilibrium in which contingent 
commodities are traded. It follows from the general theory of Chapter 8 that an Arrow- 
Debreu equilibrium results in a Pareto optimal allocation of risk. 

In Section 10 4, we provide an important reinterpretation of the concept of Arrow- 
Debreu equilibrium. We show that, under the assumptions of self-fulfilling, orrational 
expectations, Arrow-Debreu equilibria can be implemented by combining trade in a certain 
restricted set of contingent commodities with spot trade that occurs after the resolution 
of uncertainty. This results in a significant reduction in the number of ex ante (i.e., before 
uncertainty) markets that must operate. 

In Section 10 5 we briefly illustrate some of the welfare difficulties raised by the 
possibility of incomplete markets, that is, by the possibility of there being too few asset 
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markets to guarantee a fully Pareto optimal allocation of risk. 

10.2 A Market Economy with Contingent Commodi- 
ties 

As in our previous chapters, we contemplate an environment with L physical commodi- 
ties, n consumers, and J firms. The new element is that technologies, endowments, and 
preferences are now uncertain. 

Throughout this chapter, we represent uncertainty by assuming that technologies, 
endowments, and preferences depend on the state of the world. A state of the world is 
to be understood as a complete description of a possible outcome of uncertainty. For 
simplicity we take S to be a finite set with (abusing notation slightly) S elements. A 
typical element is denoted s = 1, . . . , S. 

We state the key concepts of a (state-) contingent commodity and a (state-) contingent 
commodity vector. Using these concepts we shall then be able to express the dependence 
of technologies, endowments, and preferences on the realized states of the world. 

Definition 10.2.1 For every physical commodity £ = 1, . . . , L and state s = 1, . . . , S , a 

unit of (state-) contingent commodity is is a title to receive a unit of the physical good £ 
if and only if s occurs. Accordingly, a (state-) contingent commodity vector is specified by 

x = (x u , x L1 , x is , x LS ) G R LS , 

and is understood as an entitlement to receive the commodity vector (x ls , . . . ,xl s ) if state 
s occurs. 

We can also view a contingent commodity vector as a collection of L random variables, 
the Ith random variable being (xn, . . . , xi s ). 

With the help of the concept of contingent commodity vectors, we can now describe 
how the characteristics of economic agents depend on the state of the world. To begin, 
we let the endowments of consumer % — 1, . . . , n be a contingent commodity vector: 

Wi = (w Ui , w Lli , w lsi , w Lsi ) e R LS . 
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The meaning of this is that if state s occurs then consumer i has endowment vector 
(w lsi , . . .,w Lsi ) e R L . 

The preferences of consumer % may also depend on the state of the world (e.g., the con- 
sumers enjoyment of wine may well depend on the state of his health). We represent this 
dependence formally by defining the consumers preferences over contingent commodity 
vectors. That is, we let the preferences of consumer % be specified by a rational preference 
relation defined on a consumption set X, L C M. LS . 

The consumer evaluates contingent commodity vectors by first assigning to state s a 
probability ir si (which could have an objective or a subjective character), then evaluating 
the physical commodity vectors at state s according to a Bernoulli state-dependent util- 
ity function u S i(xi S i, . . . ,xisi), and finally computing the expected utility. That is, the 
preferences of consumer i over two contingent commodity vectors X{x\ 6l;C M. LS satisfy 

Xi >Pi x\ if and only if Tr si u si (x lsi , . . . , x Lsi ) ^ ^siU S i(x' lsi , . . . , x' Lsi ). 

s s 

It should be emphasized that the preferences )pi are in the nature of ex ante preferences: 
the random variables describing possible consumptions are evaluated before the resolution 
of uncertainty. 

Similarly, the technological possibilities of firm j are represented by a production set 
Yj C M. LS . The interpretation is that a (state-) contingent production plan i/j G M. LS is a 
member of Yj if for every s the input-output vector (yi S j, . . . , ULsj) of physical commodities 
is feasible for firm j when state s occurs. 

Example 10.2.1 Suppose there are two states, s\ and S2, representing good and bad 
weather. There are two physical commodities: seeds (£ = 1) and crops (£ = 2). In 
this case, the elements of Yj are four- dimensional vectors. Assume that seeds must be 
planted before the resolution of the uncertainty about the weather and that a unit of 
seeds produces a unit of crops if and only if the weather is good. Then 

Vj = (ynj,y2ij,yi2j,y22j) = (-1, 1,-1,0) 

is a feasible plan. Note that since the weather is unknown when the seeds are planted, the 
plan (—1, 1, 0, 0) is not feasible: the seeds, if planted, are planted in both states. Thus, in 
this manner we can imbed into the structure of Yj constraints on production related to 
the timing of the resolution of uncertainty. 
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To complete the description of a private market economy it only remains to specify 
ownership shares for every consumer i and firm j. In principle, these shares could also be 
state-contingent. It will be simpler, however, to let 9ij > be the share of firm j owned 
by consumer % whatever the state. Of course Ylj @ij = 1 f° r every %. 

A private market economy with uncertain thus can be written 

e= (^{^,^,^}r=i,{^}k,{%K/=i) • 

Information and the Resolution of Uncertainty 

In the setting just described, time plays no explicit formal role. In reality, however, 
states of the world unfold over time. Figure 10.1 captures the simplest example. In the 
figure, we have a period in which there is no information whatsoever on the true state 
of the world and a period 1 in which this information has been completely revealed. 

s 




r=0 ;=1 

Figure 10.1: Two periods. Perfect information at t — 1. 

The same methodology can be used to incorporate into the formalism a much more 
general temporal structure. Suppose we have T + 1 dates t — 0, 1, . . . , T and, as before, 
S states, but assume that the states emerge gradually through a tree, as in Figure 10.2. 
Here final nodes stand for the possible states realized by time t = T, that is, for complete 
histories of the uncertain environment. When the path through the tree coincides for two 
states, s and s', up to time t, this means that in all periods up to and including period 
t, s and s' cannot be distinguished. 
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t=0 t= 1 



t = 2 



Figure 10.2: An information tree: gradual release of information. 

Subsets of S are called events. A collection of events C is an information structure 
if it is a partition, that is, if for every state s there is E e £ with sei? and for any two 
E,E' e C,E ^ E' , we have EnE' = <t>. The interpretations is that if s and s' belong to 
the same event in C then s and s' cannot be distinguished in the information structure C. 

To capture formally a situation with sequential revelation of information we look 
at a family of information structures: (Co, . . . , C t , . . . , Ct)- The process of information 
revelation makes the C t increasingly fine: once one has information sufficient to distinguish 
between two states, the information is not forgotten. 

Example 10.2.2 Consider the tree in Figure 10. 2. We have 

C = ({1,2,3,4,5,6)), 

A = ({1,2}, {3}, {4,5,6}) 

C 2 = ({1}, {2}, {3}, {4}, {5}, {6}). 

The partitions could in principle be different across individuals. However, we shall 
assume that the information structure is the same for all consumers. 

A pair (t, E) where t is a date and E G C t is called a date-event. Date-events are 
associated with the nodes of the tree. Each date-event except the first has a unique 
predecessor, and each date-event not at the end of the tree has one or more successors. 

With this temporal modeling it is now necessary to be explicit about the time at 
which a physical commodity is available. Suppose there is a number H of basic physical 
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commodities (bread, leisure, etc.). We will use the double index ht to indicate the time at 
which a commodity h is produced, appears as endowment, or is available for consumption. 
Then Xhts stands for an amount of the physical commodity h available at time t along the 
path of state s. 

Fortunately, this multi-period model can be formally reduced to the timeless structure 
introduced above. To see this, we define a new set of L — H(T + 1) physical commodi- 
ties, each of them being one of these double-indexed (i.e., ht) commodities. We then say 
that a vector z G M. LS is measurable with respect to the family of information partitions 
(Co, . . . , Ct) if, for every hts and fits', we have that Zhts — Zhts 1 whenever s, s' belong to 
the same element of the partition C t . That is, whenever s and s' cannot be distinguished 
at time t, the amounts assigned to the two states cannot be different. Finally, we impose 
on endowments Wi G R LS , consumption sets Xi C M> LS and production sets Yj C M. LS 
the restriction that all their elements be measurable with respect to the family of infor- 
mation partitions. With this, we have reduced the multi-period structure to our original 
formulation. 

10.3 Arrow-Debreu Equilibrium 

We postulate the existence of a market for every contingent commodity Is. These markets 
open before the resolution of uncertainty. The price of the commodity is denoted by pi s . 
Notice that for this market to be well defined it is indispensable that all economic agents 
be able to recognize the occurrence of s. That is, information should be symmetric across 
economic agents. 

Formally, the market economy just described is nothing but a particular case of the 
economies we have studied in previous chapters. We can, therefore, apply to our market 
economy the concept of Walrasian equilibrium and, with it, all the theory developed so 
far. When dealing with contingent commodities it is customary to call the Walrasian 
equilibrium an Arrow-Debreu equilibrium. 

Definition 10.3.1 An allocation 

(x{, . . . ,x*j,yl, . . . , YJ) G X 1 x • • • x Xj x Y 1 x • • • x Yj C R LS V+-0 
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and a system of prices for the contingent commodities p = (pn, . . . , Pls) G K L5 constitute 
an Arrow-Debreu equilibrium if: 

(i) For every j, Y* satisfies p ■ y* > p ■ yj for all yj G Yj. 

(ii) For every i, x* is maximal for :>=, in the budget set {xi e X 8 : p ■ Xi < p ■ Wi + J2j @ijP ' 

(iii) EX = EjJ/j 

The positive and welfare theorems of Chapters 7 and 8 apply without modification 
to the Arrow-Debreu equilibrium. Recall that, in the present context, the convexity 
assumption takes on an interpretation in terms of risk aversion. For example, in the 
expected utility setting, the preference relation ^ is convex if the Bernoulli utilities 

u S i{x S i) are concave. 

The Pareto optimality implication of Arrow-Debreu equilibrium says, effectively, that 
the possibility of trading in contingent commodities leads, at equilibrium, to an efficient 
allocation of risk. 

It is important to realize that at any production plan the profit of a firm, p • yj, is a 
nonrandom amount of dollars. Productions and deliveries of goods do, of course, depend 
on the state of the world, but the firm is active in all the contingent markets and manages, 
so to speak, to insure completely. This has important implications for the justification of 
profit maximization as the objective of the firm. 

Example 10.3.1 Consider an exchange economy with n — 2,L — 1, and S — 2. This 
lends itself to an Edgeworth box representation because there are precisely two contingent 
commodities. In Figures xx(a) and xx(b) we have w\ = (1,0), W2 = (0,1), and utility 
functions of the form niiUi(xii)+TT2iUi(x2i) , where (ttu, 7r 2 j) are the subjective probabilities 
of consumer % for the two states. Since W\ +u>2 = (1, 1) there is no aggregate uncertainty, 
and the state of the world determines only which consumer receives the endowment of 
the consumption good. Recall that for this model [in which the Ui(-) do not depend on 
s], the marginal rate of substitution of consumer i at any point where the consumption 
is the same in the two states equals the probability ratio 71-^/7^. 

In Figure 10.3(a) the subjective probabilities are the same for the two consumers (i.e., 
ttii = ^12) and therefore the Pareto set coincides with the diagonal of the box (the box is 
a square and so the diagonal coincides with the 45-degree line, where the marginal rates of 
substitution for the two consumers are equal: 7111/7121 = 71-12/7^2). Hence, at equilibrium, 
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the two consumers insure completely; that is, consumer i's equilibrium consumption does 
not vary across the two states. In Figure 10.3(b) the consumers subjective probabilities 
are different. In particular, n n < 7r 12 (i.e., the second consumer gives more probability to 
state 1). In this case, each consumer's equilibrium consumption is higher in the state he 
thinks comparatively more likely (relative to the beliefs of the other consumer). 




(a) "21 "22 (b) 



Figure 10.3: (a) No aggregate risk: some probability assessments, (b) No aggregate risk: 
different probability assessments. 

10.4 Sequential Trade 

The Arrow-Debreu framework provides a remarkable illustration of the power of general 
equilibrium theory. Yet, it is hardly realistic. Indeed, at an Arrow-Debreu equilibrium 
all trade takes place simultaneously and before the uncertainty is resolved. Trade is a 
one-shot affair. In reality, however, trade takes place to a large extent sequentially over 
time, and frequently as a consequence of information disclosures. The aim of this section 
is to introduce a first model of sequential trade and show that Arrow-Debreu equilibria 
can be reinterpreted by means of trading processes that actually unfold through time. 

To be as simple as possible we consider only exchange economies. In addition, we take 
Xi = IR^ 5 for every %. To begin with, we assume that there are two dates, t — and 
t — 1, that there is no information whatsoever at t — 0, and that the uncertainty has 
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resolved completely at t — 1. Thus, the date-event tree is as in Figure xx.B.l. Again for 
simplicity, we assume that there is no consumption at t — 0. 

Suppose that markets for the LS possible contingent commodities are set up at t — 
0, and that (x*,. . . , x*) G IR L5n is an Arrow-Debreu equilibrium allocation with prices 
(pn, . . . ,pls) £ M L5 . Recall that these markets are for delivery of goods at t — 1 (they 
are commonly called forward markets). When period t = 1 arrives, a state of the world s 
is revealed, contracts are executed, and every consumer i receives x* si = (x* lsi1 . . . ,x* Lsi ) G 
M. L . Imagine now that, after this but before the actual consumption of x* si , markets for 
the L physical goods were to open at t — 1 (these are called spot markets). Would 
there be any incentive to trade in these markets? The answer is "no." To see why, 
suppose that there were potential gains from trade among the consumers, that is, that 
there were x S i = (xi S j, . . . ,XL S i) G M. L for i = 1, . . . ,n, such that J2i x si < Yli w si an d 
{x* U) . . . , x S i, . . . , x*J )pi ( x ii, • • • > x*j, . . . , x*J for all i, with at least one preference strict. 
It then follows from the definition of Pareto optimality that the Arrow-Debreu equilibrium 
allocation {x\, . . . , x*) G M. LSn is not Pareto optimal, contradicting the conclusion of the 
first welfare theorem. In summary at t — the consumers can trade directly to an overall 
Pareto optimal allocation; hence there is no reason for further trade to take place. In 
other words, ex ante pareto optimality implies ex post Pareto optimality and thus no ex 
post trade. 

Matters are different if not all the LS contingent commodity markets are available 
at t — 0. Then the initial trade to a Pareto optimal allocation may not be feasible 
and it is quite possible that ex post (i.e., after the revelation of the state s) the resulting 
consumption allocation is not Pareto optimal. There would then be an incentive to reopen 
the markets and retrade. 

A most interesting possibility, first observed by Arrow (1953), is that, even if not all 
the contingent commodities are available at t — 0, it may still be the case under some 
conditions that the retrading possibilities at t — 1 guarantee that Pareto optimality is 
reached, nevertheless. That is, the possibility of ex post trade can make up for an absence 
of some ex ante markets. In what follows, we shall verify that this is the case whenever 
at least one physical commodity can be traded contingently at t — if, in addition, spot 
markets occur at t — 1 and the spot equilibrium prices are correctly anticipated at t — 0. 
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The intuition for this result is reasonably straightforward: if spot trade can occur within 
each state, then the only task remaining at t = is to transfer the consumer's overall 
purchasing power efficiently across states. This can be accomplished using contingent 
trade in a single commodity. By such a procedure we are able to reduce the number of 
required forward markets for LS to S. 

Let us be more specific. At t — consumers have expectations regarding the spot 
prices prevailing at t — 1 for each possible state s E S. Denote the price vector expected 
to prevail in state s spot market by ps G M L , and the overall expectation vector by 
p = (pi, . . . ,ps) G M. LS . Suppose that, in addition, at date t — there is trade in the 
S contingent commodities denoted by 11 to IS; that is, there is contingent trade only in 
the physical good with the label 1. We denote the vector of prices for these contingent 
commodities traded at t = by q = (gi, . . . , q s ) G M 5 . 

Faced with prices q G 18L S at t — and expected spot prices (pi, ■ ■ ■ ,Ps) G WL LS at 
t — 1, every consumer i formulates a consumption, or trading, plan (zu, . . . ,z si ) G M. s 
for contingent commodities at t — 0, as well as a set of spot market consumption plans 
(xu, . . . ,x si ) G R LS for the different states that may occur at t — 1 . Of course, these 
plans must satisfy a budget constraint. Let £/,(•) be a utility function for £=j . Then the 
problem of consumer % can be expressed formally as 



Restriction (i) is the budget constraint corresponding to trade at t — 0. The family 
of restrictions (ii) are the budget constraints for the different spot markets. Note that 
the value of wealth at a state s is composed of two parts: the market value of the initial 
endowments, p s ■ w si , and the market value of the amounts z si , of good 1 bought or sold 
forward at t — 0. Observe that we are not imposing any restriction on the sign or the 
magnitude of z S {. If z S i, < —w\ S i then one says that at t — consumer % is selling good 1 
short. This is because he is selling at t — 0, contingent on state s occurring, more than he 
has at t — 1 if s occurs. Hence, if s occurs he will actually have to buy in the spot market 
the extra amount of the first good required for the fulfillment of his commitments. The 



Max 

(x u ,...,x si )m^ s 

(z u ,...,z si )m s 



■ ■ ■ i % si 



s -t- (*) Es QsZ si < 0, 

{ii) p s ■ x si < p s ■ w si + Pi s z s i for every s. 
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possibility of selling short is, however, indirectly limited by the fact that consumption, 
and therefore ex post wealth, must be nonnegative for every s. 1 

To define an appropriate notion of sequential trade we shall impose a key condition: 
Consumers' expectations must be self- fulfilled, or rational; that is, we require that con- 
sumers' expectations of the prices that will clear the spot markets for the different states 
s do actually clear them once date t — 1 has arrived and a state s is revealed. 

Definition 10.4.1 A collection formed by a price vector q = (q±, . . . , q s ) el s for con- 
tingent first good commodities at t — 0, a spot price vector 

Ps = (p ls ,...,p L s) e R L 

for every s, and, for every consumer i, consumption plans z* = (z^, . . . , z* si ) G 1R 5 at 
t — and x* = (x* u , . . . , x* si ) G R LS at t — 1 constitutes a Radner equilibrium [see Radner 
(1982)] if: 

(i) For every i, the consumption plans z*, x* solve problem (10.1). 

(ii) E< < and E» ^ < Ei w « for ever y s - 

At a Radner equilibrium, trade takes place through time and, in contrast to the Arrow- 
Debreu setting, economic agents face a sequence of budget sets, one at each date-state 
(more generally, at every date-event). 

We can see from an examination of problem (10.1) that all the budget constraints 
are homogeneous of degree zero with respect to prices. It is natural to choose the first 
commodity and to put p± s = 1 for every s, so that a unit of the s contingent commodity 
then pays off 1 dollar in state s. Note that this still leaves one degree of freedom, that 
corresponding to the forward trades at date (so we could put q\ = 1, or perhaps 

£.?. = !)■ 

1 Observe also that we have taken the wealth at t = to be zero (that is, there are no initial endowments 
of the contingent commodities). This is simply a convention. Suppose, for example, that we regard wi si , 
the amount of good 1 available at t — 1 in state s, as the amount of the s contingent commodity that i 
owns at t = (to avoid double counting, the initial endowment of commodity 1 in the spot market s at 
z = 1 should simultaneously be put to zero). The budget constraints are then: (i) J2 S 1s{z' si — W\ s i) < 
and (ii) p s ■ x si < J2i^iPisWn + pi s z' si , for every s. But letting z\ = z si + w\ S i, we see that these are 
exactly the constraints of (19.D.1). 
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In the following proposition, which is the key result of this section, we show that for 
this model the set of Arrow-Debreu equilibrium allocations (induced by the arrangement 
of one-shot trade in LS contingent commodities) and the set of Radner equilibrium al- 
locations (induced by contingent trade in only one commodity, sequentially followed by 
spot trade) are identical. 

Proposition 10.4.1 We have: 

(i) If the allocation x* G M. LSn and the contingent commodities price vec- 
tor (pi, . . . ,ps) G R+'i constitute an Arrow-Debreu equilibrium, then there 
are prices q G IR+ + for contingent first good commodities and consump- 
tion plans for these commodities z* = (z* , . . . , z*) G M. Sn such that the 
consumptions plans x*,z*, the prices q, and the spot prices (pi, ■ ■ ■ ,Ps) 
constitute a Radner equilibrium. 

(ii) Conversely, if the consumption plans x* G M. LSn ,z* G M 5n and prices 
q G (pi, • • • ,Ps) ^ ^++ constitute a Radner equilibrium, then there 
are multipliers (//!,..., [/,$) G 1R^ + such that the allocation x* and the 
contingent commodities price vector (/xipi, . . . , HsPs) G constitute an 
Arrow-Debreu equilibrium. (The multiplier ji s is interpreted as the value, 
at t = 0, of a dollar at t = 1 and state s.) 

Proof: 

i. It is natural to let q s = p± s for every s. With this we claim that, for every consumer 
i, the budget set of the Arrow-Debreu problem, 

Bf D = {(xii, . . .,x si ) G : J^Ps " ( x « ~ w ™) ^ °h 

s 

is identical to the budget set of the Radner problem, 

Bf D = {(x lh x si ) G : there are (z u , z si ) 

such that ^^p s z si < and p s ■ (x S i — w S i) < p\ s z S i for every s}. 

s 

To see this, suppose that Xi = (xu, . . . , x si ) G Bf D . For every s, denote z si = 
0-/Pu)Ps ■ ( x si - w si ). Then J2 s p s z si = J2 s p u z si = J2 s p s ■ (x si - w si ) < and 
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p s ■ (x S i — w S i) = piz S i for every s. Hence, x\ G Bf. Conversely, suppose that 
Xi = (xu, . . . , x S i) G Bf; that is, for some (zu, . . . , z S i) we have J2 S q s z S i < and 
Ps ■ (x S i ~ w S i) = PuZsi, for every s. Summing over s gives Y. s Ps ' ( x si ~ w si ) < 
Y^sP^Zsi = Y. s <lsZsi < 0. Hence, Xi e B AD i. 

We conclude that our Arrow-Debreu equilibrium allocation is also a Radner equilib- 
rium allocation supported by q = (pn, . . . ,pis) € K 5 , the spot prices (pi, . . . ,Ps), 
and the contingent trades (z^, . . . , z* si ) G M. s defined by z* si = (l/pi s )p s • (a;^ — w S j)- 
Note that the contingent markets clear since, for every s,^ li z* i = (l/pi s )p s • 

Ei(^ - < o. 

ii. Choose /i s so that /j, s pi s = q s . Then we can rewrite the Radner budget set of 
every consumer i as Bf = {(xu, . . . , x S i) G B, LS : there are (z^, . . . , z S i) such that 
S s IsZsi < and • (x si — w si ) < q s z si for every s}. But from this we can proceed 
as we did in part (i) and rewrite the constraints, and therefore the budget set, in 
the Arrow-Debreu form: 

B\ = Bf D = {( Xli , x si ) G R LS : ^ n,p a ■ (x si - w si ) < 0}. 

s 

Hence, the consumption plan x* is also preference maximizing in the budget set 
B AD i. Since this is true for every consumer i, we conclude that the price vector 
(fJ-sPij ■ ■ ■ y^sPs) G M. LS clears the markets for the LS contingent commodities. 

Example 10.4.1 : Consider a two-good, two-state, two-consumer pure exchange econo- 
my. Suppose that the two states are equally likely and that every consumer has the same, 
state-independent, Bernoulli utility function u(x S i). The consumers differ only in their 
initial endowments. The aggregate endowment vectors in the two states are the same; 
however, endowments are distributed so that consumer 1 gets everything in state 1 and 
consumer 2 gets everything in state 2. (See Figure 10.4.) 
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Figure 10.4: Reaching the Arrow-Debreu equilibrium by means of contingent trade in the 
first good only. 

By the symmetry of the problem, at an Arrow-Debreu equilibrium each consumer gets, 
in each state, half of the total endowment of each good. In Figure 19.D.1, we indicate how 
these consumptions will be reached by means of contingent trade in the first commodity 
and spot markets. The spot prices will be the same in the two states. The first consumer 
will sell an amount a of the first good contingent on the occurrence of the first state and 
will in exchange buy an amount (3 of the same good contingent on the second state. 

Remark 10.4.1 It is important to emphasize that, although the concept of Radner e- 
quilibrium cuts down the number of contingent commodities required to attain optimality 
(from LS to S), this reduction is not obtained free of charge. With the smaller number 
of forward contracts, the correct anticipation of future spot prices becomes crucial. 

10.5 Incomplete Markets 

In this section we explore the implications of having fewer than S assets, that is, of having 
an asset structure that is necessarily incomplete. We pursue this point in the two-period 
framework of the previous sections. 

We begin by observing that when K < S a Radner equilibrium need not be Pareto 
optimal. This is not surprising: if the possibilities of transferring wealth across states are 
limited, then there may well be a welfare loss due to the inability to diversify risks to 
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the extent that would be convenient. Just consider the extreme case where there are no 
assets at all. The following example provides another interesting illustration of this type 
of failure. 



Example 10.5.1 [Sunspots]. Suppose that preferences admit an expected utility repre- 
sentation and that the set of states S is such that, first, the probability estimates for the 
different states are the same across consumers (i.e., 7r S j = ir r si = tt s for all i, i', and s) 
and second, that the states do not affect the fundamentals of the economy; that is, the 
Bernoulli utility functions and the endowments of every consumer i are uniform across 
states [i.e., u S i(-) = ««(■) and w S i, = Wi, for all s]. Such a set of states is called a sunspot 
set. The question we shall address is whether in these circumstances the Radner equi- 
librium allocations can assign varying consumptions across states. An equilibrium where 
this happens is called a sunspot equilibrium. 

Under the assumption that consumers are strictly risk averse, so that the utility 
functions ttj(-) are strictly concave, any Pareto optimal allocation (x 1 , . . . ,x n ) G IR^ 5n 
must be uniform across states (or state independent); that is, for every % we must have 
xu = X2i = • ■ ■ = x S i = ■ ■ ■ = x S i. To see this, suppose that, for every % and s, we replace 
the consumption bundle of consumer i in state s,x S i G M+, by the expected consumption 
bundle of this consumer: = J2 s n s x S i G The new allocation is state independent, 
and it is also feasible because 

^2 x i = ^2Y1 113X31 = 713 ( Xsi ) - S 713 ( Wi ) = s ^2 Wi - 

i is s \ i / s \ i / i 

By the concavity of Ui(-) it follows that no consumer is worse off: 

^TtsUiixi) = Ui(xi) = Ui I ^iisXsi J > ^ 7T s Ui(x si ) for every i. 

S \ 8 / 8 

Because of the Pareto optimality of (xi, . . . , x n ), the above weak inequalities must in 
fact be equalities; that is, Ui(xj) = J2 s TT s Ui(x si ) for every i. But, if so, then the strict 
concavity of Ui(-) yields x si = x~i for every s. In summary: the Pareto optimal allocation 
(xi, . . . , x n ) G M. LSn is state independent. 

From the state independence of Pareto optimal allocations and the first welfare the- 
orem we reach the important conclusion that if a system of complete markets over the 
states S can be organized, then the equilibria are sunspot free, that is, consumption is 
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uniform across states. In effect, traders wish to insure completely and have instruments 
to do so. 

It turns out, however, that if there is not a complete set of insurance opportunities, 
then the above conclusion does not hold true. Sunspot-free, Pareto optimal equilibria 
always exist (just make the market "not pay attention" to the sunspot. But it is now 
possible for the consumption allocation of some Radner equilibria to depend on the state, 
and consequently to fail the Pareto optimality test. In such an equilibrium consumers 
expect different prices in different states, and their expectations end up being self-fulfilling. 
The simplest, and most trivial, example is when there are no assets whatsoever (K = 0). 
Then a system of spot prices (pi, ■ ■ ■ ,p s ) £ ^- LS is a Radner equilibrium if and only if every 
p s is a Walrasian equilibrium price vector for the spot economy defined by {(ui(-), Wi)} 1 ^. 
If, as is perfectly possible, this economy admits several distinct Walrasian equilibria, then 
by selecting different equilibrium price vectors for different states, we obtain a sunspot 
equilibrium, and hence a Pareto inefficient Radner equilibrium. 

We have seen that Radner equilibrium allocations need not be Pareto optimal, and 
so, in principle, there may exist reallocations of consumption that make all consumers at 
least as well off, and at least one consumer strictly better off. It is important to recognize, 
however, that this need not imply that a welfare authority who is "as constrained in 
interstate transfers as the market is" can achieve a Pareto optimum. An allocation that 
cannot be Pareto improved by such an authority is called a constrained Pareto optimum. 
A more significant and reasonable welfare question to ask is, therefore, whether Radner 
equilibrium allocations are constrained Pareto optimal. We now address this matter. This 
is a typical instance of a second-best welfare issue. 

To proceed with the analysis we need a precise description of the constrained feasible 
set and of the corresponding notion of constrained Pareto optimality. This is most simply 
done in the context where there is a single commodity per state, that is, L — 1. The 
important implication of this assumption is that then the amount of consumption good 
that any consumer i gets in the different states is entirely determined by the portfolio Zj. 
Indeed, x S i = J2k z ki r sk + w S i- Hence, we can let 

U*(zi) = U*(z H ,...,z Ki ) = Uj(y] z ki r lk + w u , . . . , ^ z ki r sk + w si ) 

k k 
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denote the utility induced by the portfolio z,- L . The definition of constrained Pareto opti- 
mally is then quite natural. 

Definition 10.5.1 The asset allocation (zi,...,z n ) G R Kn is constrained Pareto opti- 
mal if it is feasible (i.e., J2ii z i < 0) an d if there is no other feasible asset allocation 
(z[, . . . , z' n ) eR Kn such that 

U*(z[, ...,z' n )> U*(zi, ...,Zn) for every i, 

with at least one inequality strict. 

In this L — 1 context the utility maximization problem of consumer % becomes 

Max U*(z u ,...,z Ki ) 
s.t. q ■ Zi < • 

Suppose that Zi G M. K for i — 1, . . . , n, is a family of solutions to these individual 
problems, for the asset price vector q G WL K . Then q G WL K is a Radner equilibrium price 
if and only if £\ z* < 0. 2 To it we can apply the first welfare theorem and reach the 
conclusion of the following proposition. 

Proposition 10.5.1 Suppose that there two periods and only one consumption good in 
the second period. Then any Radner equilibrium is constrained Pareto optimal in the 
sense that there is no possible redistribution of assets in the first period that leaves every 
consumer as well off and at least one consumer strictly better off. 

The situation considered in Proposition 10.5.1 is very particular in that once the initial 

asset portfolio of a consumer is determined, his overall consumption is fully determined: 

with only one consumption good, there are no possibilities for trade once the state occurs. 

In particular, second-period relative prices do not matter, simply because there are no 

such prices. Things change if there is more than one consumption good in the second 

period, or if there are more than two periods, such an example can be found in Mas- 

Colell, Whinston, and Green (1995, p. 711). 

2 Recall that, given the consumptions in every state are determined. Also, the price of consumption 
good in every state is formally fixed to be 1. 
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Part IV 



Externalities and Public Goods 
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In Chapters 7 and 8, we have introduced the notions of competitive equilibrium and 
Pareto optimality, respectively. The concept of competitive equilibrium provides us with 
an appropriate notion of market equilibrium for competitive market economies. The 
concept of Pareto optimality offers a minimal and uncontroversial test that any social 
optimal economic outcome should pass since it is a formulation of the idea that there is 
no further improvement in society, and it conveniently separates the issue of economic 
efficiency from more controversial (and political) questions regarding the ideal distribution 
of well-being across individuals. 

The important results and insights obtained in Chapters 7 and 8 are the First and 
Second Fundamental Theorems of Welfare Economics. The first welfare theorem provides 
a set of conditions under which a market economy will achieve a Pareto optimal; it is, in 
a sense, the formal expression of Adam Smith's claim about the "invisible hand" of the 
market. The second welfare theorem goes even further. It states that under the same 
set of conditions as the first welfare theorem plus convexity and continuity conditions, all 
Pareto optimal outcomes can in principle be implemented through the market mechanism 
by appropriately redistributing wealth and then "letting the market work." 

Thus, in an important sense, the general equilibrium theory establishes the perfectly 
competitive case as a benchmark for thinking about outcomes in market economies. In 
particular, any inefficiency that arise in a market economy, and hence any role for Pareto- 
improving market intervention, must be traceable to a violation of at least one of these 
assumptions of the first welfare theorem. The remainder of the notes, can be viewed as a 
development of this theme, and will study a number of ways in which actual markets may 
depart from this perfectly competitive ideal and where, as a result, market equilibria fail 
to be Pareto optimal, a situation known market failure. 

In the current part, we will study externalities and public goods in Chapter 11 and 
Chapter 12, respectively. In both cases, the actions of one agent directly affect the utility 
or production of other agents in the economy. We will see these nonmarketed "goods" 
or "bads" lead to a non-Pareto optimal outcome in general; thus a market failure. It 
turns out that private markets are often not a very good mechanism in the presence of 
externalities and public goods. We will consider situations of incomplete information 
which also result in non-Pareto optimal outcomes in general in Part V. 
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It is greatly acknowledged that the materials are largely drawn from the references in 
the end of each charter, especially from Varian (1992). 
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Chapter 11 
Externalities 



11.1 Introduction 

In this chapter we deal with the case of externalities so that a market equilibrium may 
lead to non-Pareto efficient allocations in general, and thus there is a market failure. The 
reason is that there are things that people care about are not priced. Externality can 
happen in both cases of consumption and production. 
Consumption Externality: 



Ui(xi) : without preference externality 
Ui(xi, ...,x n ) : with preference externality 

in which other individuals' consumptions effect an individual's utility. 

Example 11.1.1 (i) One person's quiet environment is disturbed by another 

person's local stereo. 

(ii) Mr. A hates Mr. D smoking next to him. 

(ii) Mr. A's satisfaction decreases as Mr. C's consumption level increases, 
because Mr. A envies Mr. C's lifestyle. 

Production Externality: 

A firm's production includes arguments other than its own inputs. 



339 



For example, downstream fishing is adversely affected by pollutants emitted from an 
upstream chemical plant. 

This leads to an examination of various suggestions for alternative ways to allocate 
resources that may lead to efficient outcomes. Achieving an efficient allocation in the 
presence of externalities essentially involves making sure that agents face the correct 
prices for their actions. Ways of solving externality problem include taxation, regulation, 
property rights, merges, etc. 



11.2 Consumption Externalities 

When there are no consumption externalities, agent i's utility function is a function of 
only his own consumption: 

Ui(Xi) (ii.i) 
In this case, the first order conditions for the competitive equilibrium are given by 



MRS* = ^ = MRS* 
Py 

and the first order conditions for Pareto efficiency are given by: 



MRS* = MRS^ y . 

Thus, because of price-taking behavior, every competitive equilibrium implies Pareto 
efficiency if utility functions are quasi-concave. 

The main purpose of this section is to show that a competitive equilibrium allocation 
is not in general Pareto efficient when there exists an externality in consumption. We 
show this by examining that the first order conditions for a competitive equilibrium is 
not in general the same as the first order conditions for Pareto efficient allocations in the 
presence of consumption externalities. The following materials are mainly absorbed from 
Tian and Yang (2009). 

Consider the following simple two-person and two-good exchange economy. 

u A {x A ,x B ,yA) (11-2) 

UB(xA,XB,yB) (11-3) 
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which are assumed to be strictly increasing in his own goods consumption, quasi-concave, 
and satisfies the Inada condition §^(0) = +00 and lim^.^o ^t x % = so it results in 
interior solutions. We further assume that the gradient of is nonzero at Pareto 
efficient allocations. Here good x results in consumption externalities. 

The first order conditions for the competitive equilibrium are the same as before: 

MRS^ = V — = MRS* 
yy 

We now find the first order conditions for Pareto efficient allocations in exchange 
economies with externalities. Thus Pareto efficient allocations x* can be completely de- 
termined by the FOCs of the following problem. 

Maxu B (x A ,x B ,y B ) 

S.t. Xa + Xb < w x 

Va + Vb < w y 
u A (x A ,x B ,y A ) > u A (x* A ,x* B ,y* A ) 

The first order conditions are 

XA ■ dx- A - K + ^dx~ A =0 (1L4) 
Va : -A, + ^=0 (11.5) 

OVA 

x B : K + fi^ — = (11.6) 

ox B ox B 

y B : |f"\ = (11.7) 

\ x : w x - x A - x B > 0, > 0, \ x (w x - x A - x B ) = (11.8) 

\ ■ Wy-y A -y B >0,\y>0,\y(w y -y A -y B ) = (11.9) 

\i : u A — u* A > 0, fi > 0, /i (u A — u* A ) = (11.10) 

By (11.7), A, = |f > 0, and thus by (11.9), 

Va + Vb = w y (H- 11 ) 

which means there is never destruction of the good which does not exhibit a negative 
externality. Also, by (11.5) and (11.7), we have 
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du B 

^ ~ du A ' 

dyA 



Then, by (11.4) and (11.5), we have 



and by (11.6) and (11.7), we have 



X x 



du A 
9xa 
du A 
dyA 



du B 
9x B 
du B 
dy B 



du B 
dx A 
du B 
dy B 



dx B 
du A 
dyA 



Thus, by (11.13) and (11.14), we have 



9u A 


du B 


du B 


9u A 


dx A 
du A 


, dx A 
^ dug 


dx B 
du B 


, dx B 
du A 


dyA 


9y B 


9y B 


dyA 



(11.12) 



(11.13) 



(11.14) 



(11.15) 



which expresses the equality of the social marginal rates of substitution for the two con- 
sumers at Pareto efficient points. Thus, we immediately have the following conclusion: 



A competitive equilibrium allocations may not be Pareto optimal because the first 
order conditions' for competitive equilibrium and Pareto optimality are not the same. 

From the above marginal equality condition, we know that, in order to evaluate the 
relevant marginal rates of substitution for the optimality conditions, we must take into 
account both the direct and indirect effects of consumption activities in the presence of 
externalities. That is, to achieve Pareto optimality, when one consumer increases the 
consumption of good x, not only does the consumer's consumption of good y need to 
change, the other consumer's consumption of good y must also be changed. Therefore the 
social marginal rate of substitution of good x for good y by consumer i equals + 

Solving (11.4) and (11.6) for ji and \ x , we have 



[I 



du B 
dx B 



du B 
dx A 



dx A 



dXB 



> 



(11.16) 



and 



X x — 



Qua du B Qua du B 

8xa dx B dx B 8xa 



9u A 
dx A 



dx B 



(11.17) 
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When the consumption externality is positive, from (11.13) or (11.14), we can easily 
see that X x is always positive since X y = > 0. Also, when no externality or a one- 
sided externality 1 exists, by either (11.13) or (11.14), is positive. Thus, the marginal 
equality condition (11.15) and the balanced conditions, completely determine all Pareto 
efficient allocations for these cases. However, when there are negative externalities for 
both consumers, the Kuhn- Tucker multiplier \ x directly given by (11.17) or indirectly 
given by (11.13) or (11.14) is the sum of a negative and positive term, and thus the sign 
of X x may be indeterminate. However, unlike the claim in some textbooks such as Varian 
(1992, 438), the marginal equality condition, (11.15), and the balanced conditions may 
not guarantee finding Pareto efficient allocations correctly. 

To guarantee that an allocation is Pareto efficient in the presence of negative exter- 
nalities, we must require \ x > at efficient points, which in turn requires that social 
marginal rates of substitution be nonnegative, that is, 



du A 


du B 


du B 


8ua 




dx A 
du A 


1 9x A 
du B 


dx B 
du B 


+ l XB > o, 

Qua — ' 


(11.18) 


dVA 


dy B 


dy B 


dyA 





or equivalently requires both (11.15) and 

duAdu^ > duAduB_ , v 

(joint marginal benefit) (joint marginal cost) 

for all Pareto efficient points. 

We can interpret the term in the left-hand side of (11.19), f^-f^, as the joint marginal 
benefit of consuming good x, and the term in the right-hand side, §^§^, as the joint 
marginal cost of consuming good x because the negative externality hurts the consumers. 
To consume the goods efficiently, a necessary condition is that the joint marginal benefit 
of consuming good x should not be less than the joint cost of consuming good x. 

Thus, the following conditions 



(poW 



dug <9u£j ® U A 

® X A I dx A _ dx B I dx B > Q 

Qua "I - dug d u B & U A — 

9 VA d VB Q VB d VA 

yA + ys = w y 

Xa + Xb < W x 

(£ffe-£t£r)K-^-x B ) = o 



k ^ 8xa dx B dx B 8xa ' ^ x 
1 Only one consumer imposes an externality on another consumer 
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constitute a system (PO) from which all Pareto efficient allocations can be obtained. We 
can do so by considering three cases. 

du ^ U B ^ U B ^ U A 

Case 1. When g^g* > f^g, or equivalently g + g = g + g > 0, A. > 

S»A 8s/s 3j/b 

and thus the last two conditions in the above system (PO) reduce to 1,4 + 15 = w x . In 
this case, there is no destruction. Substituting xa + %b = w x and y A + Ub = w y into the 
marginal equality condition (11.15), it would give us a relationship between xa and y A , 
which exactly defines the Pareto efficient allocations. 

Case 2. When the joint marginal benefit equals the joint marginal cost: 

8ua Oub_ _ duAdu^ , n 2 „x 

8xa9xb 8xb Oxa' 

then 

Qua 9ub Qub Qua 

QUA oub oub qua y ' 

dyA 9y B dy B dyA 

and thus A^ = 0. In this case, when xa + %b < w x , the necessity of destruction is 
indeterminant. However, even when destruction is necessary, we can still determine the 
set of Pareto efficient allocations by using i/a + Ub = w y and the zero social marginal 
equality conditions (11.21). Indeed, after substituting Ua + Ub = w y into (11.21), we can 
solve for xa in terms ua ■ 

Case 3. When f^fg < f^gf for any allocations that satisfy x A + x B = w x , 
Ua + Ub = w y , and the marginal equality condition (11.15), the social marginal rates of 
substitution must be negative. Hence, the allocation will not be Pareto efficient. In this 
case, there must be a destruction for good x for Pareto efficiency, and a Pareto efficient 
allocation satisfies (11.21). 

Summarizing, we have the following proposition that provides two categories of suffi- 
ciency conditions for characterizing whether or not there should be destruction of endow- 
ment w x in achieving Pareto efficient allocations. 

Proposition 11.2.1 For 2x2 pure exchange economies, suppose that utility functions 
Ui(x A ,XB,yi) are continuously differentiable, strictly quasi- concave, and du ^*B,m) > Q 
for i — A,B. 

(1) If the social marginal rates of substitution are positive at a Pareto effi- 
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cient allocation x* , 2 then there is no destruction of w x in achieving Pareto 
efficient allocation x* . 

(2) If the social marginal rates of substitution are negative for any allocation 
(xa,%b) satisfying xa + xb = w x , Va + Vb = w y , and the marginal equality 
condition (11.15), then there is destruction of w x in achieving any Pareto 
efficient allocation x* . That is, x* A + x* B < w x and x* is determined by 
Va + Vb = w y and (11.21). 



Thus, from the above proposition, we know that a sufficient condition for destruction 
is that for any allocation (xa, Va-, xb-, Vb)i 



duj^ d u B ® U B ® U A 

dxj± | dx a & X B I ® X B 

du^ ~T~ dug & U B ^ U A 

9va dVB d VB d VA 



8ua du b Oua du b 
8xa 9xb dx B Oxa 



'11.22) 



x A + x B = w x 
Va + Vb = w y 

A sufficient condition for all Pareto efficient allocations (xa, Va, xb, Vb) for no destruction 



is, 



du 



dx A 

du j± & U B 



du B 
dx A_ 



9 VA d VB 



dug * duj± 



xa + xb < w x 



8ua du b Oua du b 
8xa 9xb 9xb dxA 



Va + Vb =w y 

Example 11.2.1 Consider the following utility function: 



(11.23) 



Ui(xA,XB,Vi) = y/xjjl-Xj, i e {A,B},j e {A,B},j 
By the marginal equality condition (11.15), we have 

^+l\ =(M+l 2 
xa J V V x b 




;n.24) 



and thus 

Va_ _ Vb_ 
xa x b ' 

Let xa + xb = x. Substituting xa + xb = x and Va + Vb = w y into (11.25), we have 



11.25) 



Va = Wy_ 

xa x 



;il.26) 



2 As we discussed above, this is true if the consumption externality is positive, or there is no externality 
or only one side externality. 
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Then, by (11.25) and (11.26), we have 

du A du B _ 1 fyX FUb _ Va _w y 




(11.27) 

jlb ^-IjA 

8ua du b 



8xa 9xb 4 Y x A \ x B Ax a 4a; 

and 

f)ll A f)ll n 

1. (11.28) 



Oxb 9xa 

'PC 

Qua 9ub 

dxA dxs 9xb 9xa 



Thus, x = w y /A is the critical point that makes f^f^ - f^ff 1 = 0, or equivalently 



duj± 9ug & U B ® u A 

9va 9y B dy B 8y A 

thus, by (11.22), there is destruction in any Pareto efficient allocation. If w x < then 
fefe ~ fefe > °' and > b y ( 1L23 )> no Pareto optimal allocation requires destruction. 
Finally, when w x — any allocation that satisfies the marginal equality condition 
(11.15) and the balanced conditions xa + x b — w x and Ha + Ub = u! y also satisfies (11.19), 
and thus it is a Pareto efficient allocation with no destruction. 

Now, let us set out the sufficient conditions for destruction and non-destruction in 
detail. Note that if the sufficient condition for non-destruction holds, then xa + xb — w x 
would also hold as an implication. Thus, to apply both the sufficient conditions for 
non-destruction and destruction, one should use the following three conditions 

du a ^ U B ^ U B ^ U A 

dx a I ® x A ^ X B l & X B 

duA ' & U B @ U B ^ U A 

dVA d VB d VB 8 VA 

X A + X B = Wx 

Va + Vb = w y 

If an allocation also satisfies the condition ^a^m. > |ma|ms then the allocation is 

OX A OXb — OXb OX A ' 

Pareto efficient, and further there is no destruction. Otherwise, it is not Pareto efficient. 
If this is true for all such non-Pareto allocations, all Pareto efficient condition must hold 
with destruction. 

Note that, since |^ and represent marginal benefit, they are usually diminishing 
in consumption in good x. Since |^ and |^ are in the form of a marginal cost, their 
absolute values would be typically increasing in the consumption of good x. Hence, when 
total endowment w x is small, the social marginal benefit would exceed the social marginal 
cost so that there is no destruction. As the total endowment of w x increases, the social 
marginal cost will ultimately outweigh the marginal social benefit, which results in the 
destruction of the endowment of w x . 
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Alternatively, we can get the same result by using social marginal rates of substitu- 
tion. When utility functions are strictly quasi-concave, marginal rates of substitution 
are diminishing. Therefore, in the presence of negative consumption externalities, social 
marginal rates of substitution may become negative when the consumption of good x be- 
comes sufficiently large. When this occurs, it is better to destroy some resources for good 
x. As the destruction of good x increases, which will, in turn, decrease the consumption 
of good x, social marginal rates of substitution will increase. Eventually they will become 
nonnegative. 

The destruction issue is not only important in theory, but also relevant to reality. It 
can be used to explain a well-known puzzle of the happiness-income relationship in the 
economic and psychology literatures: happiness rises with income up to a point, but not 
beyond it. For example, well-being has declined over the last quarter of a century in the 
U.S., and life satisfaction has run approximately flat across the same time in Britain. If 
we interpret income as a good, and if consumers envy each other in terms of consumption 
levels, by our result, when the income reaches a certain level, one may have to freely 
dispose of wealth to achieve Pareto efficient allocations; otherwise the resulting allocations 
will be Pareto inefficient. Therefore, economic growth does not raise well-being indexed 
by any social welfare function once the critical income level is achieved. For detailed 
discussion, see Tian and Yang (2007, 2009). 

11.3 Production Externality 

We now show that allocation of resources may not be efficient also for the case of exter- 
nality in production. To show this, consider a simple economy with two firms. Firm 1 
produces an output x which will be sold in a competitive market. However, production 
of x imposes an externality cost denoted by e(x) to firm 2, which is assumed to be convex 
and strictly increasing. 

Let y be the output produced by firm 2, which is sold in competitive market. 

Let c x (x) and c y (y) be the cost functions of firms 1 and 2 which are both convex and 
strictly increasing. 
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The profits of the two firms: 

7Tl = p x x - c x (x) (11.29) 
vr 2 = VyV ~ c y (y) - e(x) (11.30) 

where p x and p y are the prices of x and y, respectively. Then, by the first order conditions, 
we have for positive amounts of outputs: 

Px = c x (x) (11.31) 
Py = c' y (y) (11.32) 

However, the profit maximizing output x c from the first order condition is too large from 
a social point of view. The first firm only takes account of the private cost - the cost 
that is imposed on itself- but it ignores the social cost - the private cost plus the cost it 
imposes on the other firm. 

What's the social efficient output? 

The social profit, i\\ + 7r 2 , is not maximized at x c and y c which satisfy (11.31) and 
(11.32). If the two firms merged so as to internalize the externality 

max p x x + p y y - c x (x) - e(x) - c y (y) (11.33) 

which gives the first order conditions: 

p x = c' x (x*) + e'(x*) 
Py = c' y (y*) 

where x* is an efficient amount of output; it is characterized by price being equal to the 
marginal social cost. Thus, production of x* is less than the competitive output in the 
externality case by the convexity of e(x) and c x (x). 
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Figure 11.1: The efficient output x* is less than the competitive output x c . 



11.4 Solutions to Externalities 

From the above discussion, we know that a competitive market in general may not result 
in Pareto efficient outcome in the presence of externalities, and one needs to seek some 
other alternative mechanisms to solve the market failure problem. In this section, we now 
introduce some remedies to this market failure of externality such as: 

1. Pigovian taxes 

2. Voluntary negotiation (Coase Approach) 

3. Compensatory tax/subsidy 

4. Creating a missing markets with property rights 

5. Direct intervention 

6. Merges 

7. Incentives mechanism design 
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Any of the above solution may result in Pareto efficient outcomes, but may lead to different 
income distributions. Also, it is important to know what kind of information are required 
to implement a solution listed above. 

Most of the above proposed solutions need to make the following assumptions: 

f . The source and degree of the externality is identifiable. 

2. The recipients of the externality is identifiable. 

3. The causal relationship of the externality can be established objectively. 

4. The cost of preventing (by different methods) an externality are perfectly 
known to everyone. 

5. The cost of implementing taxes and subsides is negligible. 

6. The cost of voluntary negotiation is negligible. 

11.4.1 Pigovian Tax 

Set a tax rate, t, such that t = e'(x*). This tax rate to firm f would internalize the 
externality. 

7Ti — p x ■ x — c x (x) — t ■ X (11.34) 

The first order condition is: 

p x = c' x (x)+t = c' x (x) + e'(x*), (11.35) 

which is the same as the one for social optimality. That is, when firm 1 faces the wrong 
price of its action, and a tax t = e'(x*) should be imposed for each unit of firm l's 
production. This will lead to a social optimal outcome that is less than that of competitive 
equilibrium outcome. Such correction taxes are called Pigovian taxes. 

The problem with this solution is that it requires that the taxing authority knows the 
externality cost e(x). But, how does the authority know the externality and how do they 
estimate the value of externality in real world? If the authority knows this information, it 
might as well just tell the firm how much to produce in the first place. So, in most case, 
it will not work well. 
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11.4.2 Coase' Voluntary Negotiation and Enforceable Property 
Rights Approach 

A different approach to the externality problem relies on the parties to negotiate a solution 
to the problem themselves. 

The greatest novelty of Nobel laureate Ronald Coase's contribution was the systematic 
treatment of trade in property rights. To solve the externality problem, Coase in a 
famous article, "The Problem of Social Cost", in 1960 argues that the success of such a 
system depends on making sure that property rights are clearly assigned. The so-called 
Coases Theorem assesses that as long as property rights are clearly assigned, the two 
parties will negotiate in such a way that the optimal level of the externality-producing 
activity is implemented. As a policy implication, a government should simply rearrange 
property rights with appropriately designed property rights. Market then could take care 
of externalities without direct government intervention. 

Coase theorem thus contains two claims. One is that the level of the externality will be 
the same regardless of the assignment of property rights, which is called Coase Neutrality 
Theorem. The second one is that voluntary negotiations over externalities will lead to a 
Pareto-optimal outcome, which is called Coase Efficiency Theorem. 

Coase shows his propositions mainly by using examples of economy with only two 
agents and detrimental externality. Coase made an observation that in the presence of 
externalities, the victim has an incentive to pay the firm to stop the production if the 
victim can compensate the firm by paying p x — c' x {x*). The following example captures 
Coase's arguments. 

Example 11.4.1 Two firms: One is chemical factory that discharges chemicals into a 
river and the other is the fisherman. Suppose the river can produce a value of $50,000. 
If the chemicals pollute the river, the fish cannot be eaten. How does one solve the 
externality? Coase's method states that as long as the property rights of the river are 
clearly assigned, it results in efficient outcomes. That is, the government should give the 
ownership of the lake either to the chemical firm or to the fisherman, then it will yield an 
efficient output. To see this, assume that: 

The cost of the filter is denoted by Cf. 

Case 1: The lake is given to the factory. 
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i) Cf < $50,000. The fisherman is willing to buy a filter for the factory. The 

fisherman will pay for the filter so that the chemical cannot pollute the 
lake. 

ii) Cf > $50,000 - The chemical is discharged into the lake. The fisherman 
does not want to install any filter. 

Case 2: The lake is given to the fisherman, and the firm's net product revenue is grater 
than $50,000. 

i) Cf < $50,000 - The factory buys the filter so that the chemical cannot 

pollute the lake. 

ii) Cf > $50, 000 - The firm pays $50,000 to the fisherman then the chemical 
is discharged into the lake. 

Like the above example, Cases's own examples supporting his claims do with negotia- 
tions between firms or business rather than those between individuals. This difference is 
important since firms maximize profits rather than utility, and act as fiduciaries for indi- 
viduals. We have to make some restricted assumptions on consumers' utility functions to 
make Coase's Theorem to be held. 

Now consider an economy with two consumers with L goods. Further, consumer % has 
initial wealth Wi. Each consumer has preferences over both the commodities he consumes 
and over some action h that is taken by consumer 1. That is, 

Mj(ajj , . . . , , h). 

Activity h is something that has no direct monetary cost for person 1. For example, h 
is the quantity of loud music played by person 1. In order to play it, the consumer must 
purchase electricity, but electricity can be captured as one of the components of X{. From 
the point of view of consumer 2, h represents an external effect of consumer l's action. 
In the model, we assume that 

Thus the externality in this model lies in the fact that h affects consumer 2's utility, 
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but it is not priced by the market. Let Vi(p, Wi, h) be consumer i's indirect utility function: 

Vi(wi,h) = maxui(xi,h) 

S.t. pXi ^ Wi. 

We assume that preferences are quasi-linear with respect to some numeraire commod- 
ity. Thus, the consumer's indirect utility function takes the form: 

Vi(wi, h) = <f>i(h) + Wi. 

We further assume that utility is strictly concave in h : (f)'-(h) < 0. Again, the 
competitive equilibrium outcome in general is not Pareto optimal. In order to maximize 
utility, the consumer 1 should choose h in order to maximize v\ so that the interior solution 
satisfies (p[(h*) = 0. Even though consumer 2's utility depends on h, it cannot affect the 
choice of h. 

On the other hand, the socially optimal level of h will maximize the sum of the 
consumers' utilities: 

max0x(/i) + 4>2{h). 

h 

The first-order condition for an interior maximum is: 

#(/O + #(/O=0, 

where h** is the Pareto optimal amount of h. Thus, the social optimum is where the 
sum of the marginal benefit of the two consumers equals zero. In the case where the 
externality is bad for consumer 2 (loud music), the level of h* > h**. That is, too much 
h is produced. In the case where the externality is good for consumer 2 (baking bread 
smell or yard beautification), too little will be provided, h* < h**. 

Now we show that, as long as property rights are clearly assigned, the two parties 
will negotiate in such a way that the optimal level of the externality-producing activity 
is implemented. We first consider the case where consumer 2 has the right to prohibit 
consumer 1 from undertaking activity h. But, this right is contractible. Consumer 2 can 
sell consumer 1 the right to undertake h 2 units of activity h in exchange for some transfer, 
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T 2 . The two consumers will bargain both over the size of the transfer T 2 and over the 
number of units of the externality good produced, h 2 . 

In order to determine the outcome of the bargaining, we first specify the bargaining 
mechanism as follows: 

1. Consumer 2 offers consumer 1 a take-it-or- leave-it contract specifying a 
payment T 2 and an activity level h 2 . 

2. If consumer 1 accepts the offer, that outcome is implemented. If consumer 1 
does not accept the offer, consumer 1 cannot produce any of the externality 
good, i.e., h — 0. 

To analyze this, begin by considering which offers (h, T) will be accepted by consumer 
1. Since in the absence of agreement, consumer 1 must produce h — 0, consumer 1 will 
accept (h 2 ,T 2 ) if and only if it offers higher utility than h — 0. That is, consumer 1 
accepts if and only if: 

0i(/i 2 )- 72^(0). 

Given this constraint on the set of acceptable offers, consumer 2 will choose (h 2 ,T 2 ) 
in order to solve the following problem: 

max 02 + T 2 

h,2,T 2 

s.*:0i(/i 2 )- 72^(0). 

Since consumer 2 prefers higher T 2 , the constraint will bind at the optimum. Thus 
the problem becomes: 

max0!(/i 2 ) + 4> 2 {h 2 ) - 0i(O). 
The first-order condition for this problem is given by: 

Mh) + <t>' 2 (h 2 ) = 0. 

But, this is the same condition that defines the socially optimal level of h 2 . Thus 
consumer 2 chooses h 2 = h**, and, using the constraint, T 2 = <fii(h**) — 0i(O). And, the 
offer (h 2 ,T 2 ) is accepted by consumer 1. Thus this bargaining process implements the 
social optimum. 
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Now we consider the case where consumer 1 has the right to produce as much of the 
externality as she wants. We maintain the same bargaining mechanism. Consumer 2 
makes consumer 1 a take-it-or-leave-it offer (hi,Ti), where the subscript indicates that 
consumer I has the property right in this situation. However, now, in the event that 

1 rejects the offer, consumer 1 can choose to produce as much of the externality as she 
wants, which means that she will choose to produce h*. Thus the only change between 
this situation and the first case is what happens in the event that no agreement is reached. 
In this case, consumer 2's problem is: 

max0 2 (/ii) — Ti 

hi,Ti 

s.t-.Mh^+T^Mh*) 

Again, we know that the constraint will bind, and so consumer 2 chooses hi and Ti in 
order to maximize 

max 0i (hi) + (f) 2 (hi) - 4>i(h*) 

which is also maximized at hi = h**, since the first-order condition is the same. The only 
difference is in the transfer. Here Ti = <t>i{h*) — <f>i(h**). 

While both property-rights allocations implement h**, they have different distribu- 
tional consequences. The payment of the transfer is positive in the case where consumer 

2 has the property rights, while it is negative when consumer f has the property right- 
s. The reason for this is that consumer 2 is in a better bargaining position when the 
non-bargaining outcome is that consumer f is forced to produce units of the externality 
good. 

However, note that in the quasilinear framework, redistribution of the numeraire com- 
modity has no effect on social welfare. The fact that regardless of how the property rights 
are allocated, bargaining leads to a Pareto optimal allocation is an example of the Coase 
Theorem: If trade of the externality can occur, then bargaining will lead to an efficient 
outcome no matter how property rights are allocated (as long as they are clearly allo- 
cated). Note that well-defined, enforceable property rights are essential for bargaining 
to work. If there is a dispute over who has the right to pollute (or not pollute), then 
bargaining may not lead to efficiency An additional requirement for efficiency is that the 
bargaining process itself is costless. Note that the government doesnt need to know about 
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individual consumers here - it only needs to define property rights. However, it is critical 
that it do so clearly. Thus the Coase Theorem provides an argument in favor of having 
clear laws and well-developed courts. 

However, Hurwicz (Japan and the World Economy 7, 1995, pp. 49-74) argued that, 
even when the transaction cost is zero, absence of income effects in the demand for the 
good with externality is not only sufficient (which is well known) but also necessary for 
Coase Neutrality Theorem to be true, i.e., when the transaction cost is negligible, the 
level of pollution will be independent of the assignments of property rights if and only if 
preferences of the consumers are quasi-linear with respect to the private good, leading to 
absence of income effects in the demand for the good with externality. 

Unfortunately, as shown by Chipman and Tian (2012, Economic Theory), the proof 
of Hurwicz's claim on the necessity of parallel preferences for "Coase's conjecture" is 
incorrect. To see this, consider the following class of utility functions Ui(xi, h) that have 
the functional form: 

Ui(xi,h) =x i e- h + (f) l (h), 1 = 1,2 (11.36) 

where 

0.(/i) = J e - h bi{h)dh. (11.37) 

Ui(xi, h) is then clearly not quasi-linear in Xj. It is further assumed that for all h G (0, 77] , 
hih) > £, b 2 (h) < 0, b[(h) <0(i = 1,2), &!(0) + 6 2 (0) > £, and b^rj) + b 2 (rj) < £. 
We then have 

dUi/dxi = e~ h >0, i = 1,2, 

dUjdh = -x ie ~ h + b 1 {h)e' h > e' h [i- Xl ] > 0, 

8U 2 /dh = -x 2 e~ h + b 2 (h)e- h < 

for (xi,h) G (0,£) x (0, 77), i — 1,2. Thus, by the mutual tangency equality condition for 
Pareto efficiencey, we have 

= + w = ~ Xi - x > + + Hh) = Hh) + Hh) - f • (1L38) 

which is independent of X{. Hence, if (xi, x 2 , h) is Pareto optimal, so is (x[, x 2 , h) provided 
Xl + x 2 = x[ + x' 2 = £. Also, note that b'^h) < (i = 1,2), 61 (0) + 62(0) > f, and 
61(77) + 62(77) < ^. Then b\ (h) + b 2 (h) is strictly monotone and thus there is a unique h G 
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[0,rj\, satisfying (11.38). Thus, the contract curve is horizontal even though individuals' 
preferences need not be parallel. 



Example 11.4.2 Suppose b^h) = {l + h) a rp+£ with a < 0, and b 2 (h) = -W '. Then, for 
all/ie (0,77],&i(/i) >f,6 2 (/0 <0,6J(/i) <0(i = l,2),6i(0)+6 2 (0) >£, and 6i(?y) +62(77) < 
£. Thus, 0i(/i) = / e~ h bi{h)dh is concave, and Ui(xi,h) = Xie~ h + J e~ h bi{h)dh is quasi- 
concave, dUi/dx,i > and dUi/dh > 0, and dU 2 /dh < for (x^, /i) G (0,£) x (0,77), 
i = 1,2, but it is not quasi-linear in Xj. 

Chipman and Tian (2012) then investigate the necessity for the "Coase conjecture" 
that the level of pollution is independent of the assignments of property rights. This 
reduces to developing the necessary and sufficient conditions that guarantee that the 
contract curve is horizontal so that the set of Pareto optima for the utility functions 
is fa-constant. This in turn reduces to finding the class of utility functions such that 
the mutual tangency (first-order) condition does not contain Xj and consequently it is a 
function, denoted by g(h), of h only: 

dUi IdU, dU 2 /dU 2 

-dh/dx- 1 + m;/dxr 2 =9{h) = - (1L39) 

Let Fi(xi, h) — ^/|^ (i — 1, 2), which can be generally expressed as 

Fi(xi, h) = XiiJi(h) + fi(xi, h) + bi(h), 

where fi(xi,h) are nonseparable and nonlinear in Xj. ipi(h), bi(h), and fi(xi,h) will be 
further specified below. 

Let F(x, h) = .Fi(x, h) + F 2 (t; — x, h). Then the mutual tangency equality condition 
can be rewritten as 

F(x,h) = 0. (11.40) 

Thus, the contract curve, i.e., the locus of Pareto-optimal allocations, can be expressed 
by a function h = f(x) that is implicitly defined by (11.40). 

Then, the Coase Neutrality Theorem, which is characterized by the condition that the 
set of Pareto optima (the contract curve) in the (x, h) space for x, > is a horizontal line 
h = constant, implies that 

h = f(x) = h 
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with h constant, and thus we have 



dh F x 

1- = —FT = 

ax Fh 



for all x G [0, £] and F h ^ 0, which means that the function F(x, h) is independent of x. 
Then, for all x G [0,£], 

fc) = xMh) + (£ - a;)^ 2 (/i) + fc) + / 2 (£ - a;, /i) + 6i(/i) + h(h) = g(h). (11.41) 

Since the utility functions C/i and ^2 are functionally independent, and x disappears 
in (11.41), we must have ipi(h) = ^{h) = ip(h) and /i(x, h) = — / 2 (C — x,h) = for all 
x G [0,£]. Therefore, 

F(x, ft) = ^(/i) + h(h) + 6 2 (/i) = g(h), (11.42) 

and 

^/|^ = *0 = + W (H-43) 

which is a first-order linear partial differential equation. Then, from Polyanin, Zaitsev, 
and Moussiaux (2002), 3 we know that the principal integral Ui(xi,h) of (11.43) is given 
by 

U i {x i ,h) = x i eI*W + <i> i {h), i = l,2 (11.44) 

with 

^) = /^<W. (11.45) 

The general solution of (11.43) is then given by Ui(x,y) = ip(Ui), where ip is an arbitrary 
function. Since a monotonic transformation preserves orderings of preferences, we can 
regard the principal solution Ui(xi,h) as a general functional form of utility functions 
that is fully characterized by (11.43). 

Note that (11.44) is a general utility function that contains quasi-linear utility in Xi 
and the utility function given in (11.36) as special cases. Indeed, it represents parallel 
preferences when ip{h) = and also reduces to the utility function given by (11.36) when 
4(h) = -1. 

To make the the mutual tangency (first-order) condition (11.39) be also sufficient 
for the contract curve to be horizontal in a pollution economy, we assume that for all 
3 It can be also seen from http://eqworld.ipmnet.ru/en/solutions/fpde/fpdell04.pdf. 
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h G (0,77], Xl ip(h) + h(h) > 0, x 2 ^(h) + b 2 (h) < 0, ^'(h) < 0, b'^h) < (i = 1,2), 
&(0) + 6i(0) + 6 2 (0) > 0, and &(rj) + 61(77) + b 2 (r]) < 0. 
We then have for (xi,h) G (0,£) x (0, 77), i = 1, 2, 



ac/i/axi = e^ (h) >0, 7 = 1,2, 
dUjdh = e^ (h) [xMh)+b 1 (h)]>0, 
dU 2 /dh = e^^[x 2 tP(h)+b 2 (h)]<0, 



and thus 



= f)v/S + ^/S = (a;i+ X2 ^ (/l) + hl{h) + &2(/l) = ^ {h) + + &2(/l) ' 

(11.46) 

which does not contain Xj. Hence, if (x±,x 2 , h) is Pareto optimal, so is (x , 1 ,x' 2 , h) provided 
Xl +x 2 = x\+x' 2 = f . Also, note that ip'{h) < 0, b'^h) < (i = 1, 2), ^(0)+6i(0)+6 2 (0) > 
0, and ^(77) + 61(77) + 62(77) < 0. Then £,ip{h) + bi(h) + b 2 (h) is strictly monotone and thus 
there is a unique h G [0,77] that satisfies (11.46). Thus, the contract curve is horizontal 
even though individuals' preferences need not be parallel. 

The formal statement of Coase Neutrality Theorem obtained by Chipman and Tian 
(2012) thus can be set forth as follows: 

Proposition 11.4.1 (Coase Neutrality Theorem) In a pollution economy considered 
in the chapter, suppose that the transaction cost equals zero, and that the utility functions 
Ui(xi,h) are differ entiable and such that dUi/dxi > 0, and dUi/dh > but dU 2 /dh < 
for (xi,h) G (0,0 x (0,7/) 7 i = 1,2. Then, the level of pollution is independent of 
the assignments of property rights if and only if the utility functions Ui(x,y), up to a 
monotonic transformation, have a functional form given by 

V l(Xi ,U )=Xie ^ + f e ^\WK (11.47) 

where h and bi are arbitrary functions such that the Ui(xi, h) are differ entiable, dUi/dxi > 
0, and dUjdh > but dU 2 /dh < for (x h h) G (0, f ) x (0, 77), i = 1, 2. 

Although the above Coase neutrality theorem covers a wider class of preferences than 
quasi-linear utility functions, it still puts a significant restriction on the domain of its 
validity due to the special functional forms of the utility functions with respect to the 
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private good. Thus, the Coase Neutrality Theorem may be best applied to production 
externalities between firms, but rather than consumption externalities. 

The problem of this Coase theorem is that, costs of negotiation and organization, in 
general, are not negligible, and the income effect may not be zero. Thus, a privatization is 
optimal only in case of zero transaction cost, no income effect, and perfect economic envi- 
ronments. Tian (2000, 2001) shown that the state ownership and the collective ownership 
can also be optimal when an economic environment is sufficient imperfect or somewhere 
in the middle. 

The problem of the Coase Efficiency Theorem is more serious. First, as Arrow (1979, 
p. 24) pointed out, the basic postulate underlying Coase's theory appears to be that 
the process of negotiation over property rights can be modelled as a cooperative game, 
and this requires the assumption that each player know the preferences or production 
functions of each of the other players. When information is not complete or asymmetric, 
in general we do not have a Pareto optimal outcome. For instance, when there is one 
polluter and there are many pollutees, a "free-rider" problem arises and there is an an 
incentive for pollutees to misrepresent their preferences. Whether the polluter is liable 
or not, the pollutees may be expected overstate the amount they require to compensate 
for the externality. Thus, we may need to design an incentive mechanism to solve the 
free-rider problem. 

Secondly, even if the information is complete, there are several circumstances that have 
led a number of authors to questions the conclusion in the Coase Efficiency Theorem: 

(1) The core may be empty, and hence no Pareto optimum exists. An exam- 
ple of this for a three-agent model was presented by Aivazian and Callen 
(1981). 

(2) There may be a fundamental non-convexity that prevents a Pareto opti- 
mum from being supported by a competitive equilibrium. Starrettt (1972) 
showed that externalities are characterized by "fundamental non-convexities" 
that may preclude existence of competitive equilibrium. 

(3) When an agent possesses the right to pollute, there is a built-in incentive 
for extortion. As Andel (1966) has pointed out, anyone with the right to 
pollute has an incentive to extract payment from potential pullutees, e.g., 
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threat to blow a bugle in the middle of the night. 

Thus, the hypothesis that negotiations over externalities will mimic trades in a compet- 
itive equilibrium is, as Coase himself has conceded, not one that can be logically derived 
from his assumptions, but must be regarded as an empirical conjecture that may or may 
not be confirmed from the data. A lot of theoretical work therefore still remains in order 
to provide Coaasian economics with the rigorous underpinning. 

11.4.3 Missing Market 

We can regard externality as a lack of a market for an "externality." For the above 
example in Pigovian taxes, a missing market is a market for pollution. Adding a market 
for firm 2 to express its demand for pollution - or for a reduction of pollution - will provide 
a mechanism for efficient allocations. By adding this market, firm 1 can decide how much 
pollution it wants to sell, and firm 2 can decide how much pollution it wants to buy. 

Let r be the price of pollution. 

x 1 = the units of pollution that firm 1 wants to sell; 
x 2 = the units of pollution for firm 2 wants to buy. 
Normalize the output of firm 1 to X\. 

The profit maximization problems become: 

7Ti = PxX-v + rx ± - ci(xi) 

7T2 = VyV - rx 2 - e 2 (x 2 ) - c y (y) 

The first order conditions are: 

p x + r = c'^Xi) for Firm 1 
p y = c' y (y) for Firm 2 
— r = e'(x 2 ) for Firm 2. 

At the market equilibrium, x\ = x* 2 = x*, we have 

p x = d 1 (x*) + e'(x*) (11.48) 

which results in a social optimal outcome. 
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11.4.4 The Compensation Mechanism 

The Pigovian taxes were not adequate in general to solve externalities due to the informa- 
tion problem: the tax authority cannot know the cost imposed by the externality. How 
can one solve this incomplete information problem? 

Varian (AER 1994) proposed an incentive mechanism which encourages the firms to 
correctly reveal the costs they impose on the other. Here, we discuss this mechanism. In 
brief, a mechanism consists of a message space and an outcome function (rules of game). 
We will introduce in detail the mechanism design theory in Part V. 

Strategy Space (Message Space): M = Mi x M 2 with M 1 = {(ti,xi)}, where t 1 is 
interpreted as a Pigovian tax proposed by firm 1 and X\ is the proposed level of output by 
firm 1, and t 2 is interpreted as a Pigovian tax proposed by firm 2 and y 2 is the proposed 
level of output by firm 2. 

The mechanism has two stages: 

Stage 1: (Announcement stage): Firms 1 and 2 name Pigovian tax rates, ti, i — 1,2, 
which may or may not be the efficient level of such a tax rate. 

Stage 2: (Choice stage): If firm 1 produces x units of pollution, firm 1 must pay t 2 x 
to firm 2. Thus, each firm takes the tax rate as given. Firm 2 receives t\X units as 
compensation. Each firm pays a penalty, (ti — t 2 ) 2 , if they announce different tax rates. 

Thus, the payoffs of two firms are: 



Because this is a two-stage game, we may use the subgame perfect equilibrium, i.e., an 
equilibrium in which each firm takes into account the repercussions of its first-stage choice 
on the outcomes in the second stage. As usual, we solve this game by looking at stage 2 
first. 

At stage 2, firm 1 will choose x{t 2 ) to satisfy the first order condition: 



-k\ = max p x x — c x (x) — t 2 x — (ti — t 2 ) 2 



X 



n 2 = max p y y - c y (y) + t x x - e(x) - (t ± - t 2 ) 2 . 



p x - c x (x) -t 2 = 



(11.49) 



Note that, by the convexity of c X} i.e 



., c' x (x) > 0, we have 



x\t 2 ) 



1 



(11.50) 




< 0. 
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Firm 2 will choose y to satisfy p y = c' y (y). 

Stage 1: Each firm will choose the tax rate t\ and t 2 to maximize their payoffs. 
For Firm 1, 

maxp x x - c x (x) - t 2 x(t 2 ) - (h - t 2 ) 2 (11.51) 
ti 

which gives us the first order condition: 

2(ti -h) = o 

so the optimal solution is 

t\=t 2 . (11.52) 

For Firm 2, 

maxp y y - c y (y) + tix(t 2 ) - e(x(t 2 )) - (h - t 2 ) 2 (11.53) 
so that the first order condition is 

tix'{t 2 ) - e'(x(t 2 ))x'(t 2 ) + 2(*i -t 2 )=0 

and thus 

[h - e'(x(t 2 ))]x'(t 2 ) + 2(ti - t 2 ) = 0. (11.54) 
By (11.50), (11. 52) and (11.54), we have 

t* = e'(x(t*)) with t* = t* = t* 2 . (11.55) 

Substituting the equilibrium tax rate, t* = e'(x(t*)), into (11.49) we have 

p x = c' x (x*) + e'(x*) (11.56) 

which is the condition for social efficiency of production. 

Remark 11.4.1 This mechanism works by setting opposing incentives for two agents. 
Firm 1 always has an incentive to match the announcement of firm 2. But consider firm 
2's incentive. If firm 2 thinks that firm 1 will propose a large compensation rate t\ for 
him, he wants firm 1 to be taxed as little as possible so that firm 1 will produce as much 
as possible. On the other hand, if firm 2 thinks firm 1 will propose a small ti, it wants 
firm 1 to be taxed as much as possible. Thus, the only point where firm 2 is indifferent 
about the level of production of firm 1 is where firm 2 is exactly compensated for the cost 
of the externality. 
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In general, the individual's objective is different from the social goal. However, we may 
be able to construct an appropriated mechanism so that the individual's profit maximizing 
goal is consistent with the social goal such as efficient allocations. Tian (2003) also gave the 
solution to the consumption externalities by giving the incentive mechanism that results 
in Pareto efficient allocations. Tian (2004) study the informational efficiency problem of 
the mechanisms that results in Pareto efficient allocations for consumption externalities. 
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Chapter 12 
Public Goods 



12.1 Introduction 

A pure public good is a good in which consuming one unit of the good by an individual 
in no way prevents others from consuming the same unit of the good. Thus, the good is 
nonexcludable and non-rival. 

A good is excludable if people can be excluded from consuming it. A good is non-rival 
if one person's consumption does not reduce the amount available to other consumers. 

Examples of Public Goods: street lights, policemen, fire protection, highway system, 
national defence, flood-control project, public television and radio broadcast, public parks, 
and a public project. 

Local Public Goods: when there is a location restriction for the service of a public 
good. 

Even if the competitive market is an efficient social institution for allocating private 
goods in an efficient manner, it turns out that a private market is not a very good 
mechanism for allocating public goods. 

12.2 Notations and Basic Settings 

In a general setting of public goods economy that includes consumers, producers, private 
goods, and public goods. 
Let 
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n: the number of consumers. 
L: the number of private goods. 
K: the number of public goods. 

Zi C 3?^ x dt^: the consumption space of consumer %. 
Z C 3?™ L x 3?+: consumption space. 
Xi G 9?+: a consumption of private goods by consumer i. 
y G 3?+: a consumption/production of public goods. 

Wi G 9?+: the initial endowment of private goods for consumer i. For simplicity, 
it is assumed that there is no public goods endowment, but they can be 
produced from private goods by a firm. 

v G 9?+: the private goods input. For simplicity, assume there is only one firm 
to produce the public goods. 

/: 3?+ — > 9?+ : production function with y = f(v). 

6i. the profit share of consumer i from the production. 

(xi,y) G Zi. 

(x,y) = (xi, x n , y) G Z: an allocation. 

>Pi (or ui if exists) is a preference ordering. 

ti = (Zi, )^i,Wi,9i): the characteristic of consumer i. 

An allocation z = (x, y) is feasible if 

n n 

i=i i=i 

and 

y = f(v) (12.2) 
e = (ei, e n , /): a public goods economy. 

An allocation (x, y) is Pareto efficient for a public goods economy e if it is feasible and 
there is no other feasible allocation (x',y') such that [x'^y') >pi (xi,y) for all consumers i 
and (x' k ,y') >~k (xk,y) for some k. 
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An allocation (x, y) is weakly Pareto efficient for the public goods economy e if it is 
feasible and there is no other feasible allocation (x',y') such that (x'^y') >~i (x i: y) for all 
consumers i. 

Remark 12.2.1 Unlike private goods economies, even though under the assumptions 
of continuity and strict monotonicity, a weakly Pareto efficient allocation may not be 
Pareto efficient for the public goods economies. The following proposition is due to Tian 
(Economics Letters, 1988). 

Proposition 12.2.1 For the public goods economies, a weakly Pareto efficient allocation 
may not be Pareto efficient even if preferences satisfy strict monotonicity and continuity. 

Proof: The proof is by way of an example. Consider an economy with (n,L,K) = 
(3, 1, 1), constant returns in producing y from x (the input-output coefficient normalized 
to one), and the following endowments and utility functions: wa = Wb — w$ — 1, 
Ui(xi, y) — x± + y, and Ui(xi, y) = Xi + 2y for i — 2, 3. Then z = (x, y) with x = (0.5, 0, 0) 
and y = 2.5 is weakly Pareto efficient but not Pareto efficient because z' = (x',y') = 
(0, 0, 0, 3) Pareto-dominates z by consumers 2 and 3. 

However, under an additional condition of strict convexity, they are equivalent. The 
proof is left to readers. 

12.3 Discrete Public Goods 

12.3.1 Efficient Provision of Public Goods 

For simplicity, consider a public good economy with n consumers and two goods: one 
private good and one public good. 

Let gi be the contribution made by consumer i, so that 

Xi + gi = Wi 

n 
i=l 

Assume Ui(xi,y) is strictly monotonic increasing and continuous. 
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Let c be the cost of producing the public project so that the production technology is 
given by 

1 if ELi 9i = c 
otherwise 

We first want to know under what conditions providing the public good will be Pareto 
dominate to not producing it, i.e., there exists (g±, . . . , g n ) such that El=i 9i = c an d 

Ui{wi - g h 1) > Ui(wi, 0) Vi 

Let Ti be the maximum willingness-to-pay (reservation price) of consumer i, i.e., Vi 
must satisfy 

Ui(wi - r u 1) = Ui(wi,0). (12.3) 

If producing the public project Pareto dominates not producing the public project, we 
have 

Ui(wi - gi, 1) > Ui(wi, 0) = Ui(wi - r h 1) for alii (12.4) 
By monotonicity of Ui, we have 

Wi - gi > Wi - ri for i (12.5) 

Then, we have 

n > gi (12.6) 

and thus 

f>>X>^c (12.7) 

i=i i=i 

That is, the sum of the willingness-to-pay for the public good must excess the cost of 
providing it. This condition is necessary. In fact, this condition is also sufficient. In 
summary, we have the following proposition. 

Proposition 12.3.1 Providing a public good Pareto dominates not producing the public 
good if and only if £)? =1 n > XT=i 9i ^ c. 

12.3.2 Free-Rider Problem 

How effective is a private market at providing public goods? The answer as shown below 
is that we cannot expect that purely independent decision will necessarily result in an 
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efficient amount of the public good being produced. To see this, suppose 

n = 100 i = 1,2 

c = 150 (total cost) 

{ 150/2 = 75 if both agents make contributions 
150 if only agent % makes contribution 

Each person decides independently whether or not to buy the public good. As a result, 
each one has an incentive to be a free-rider on the other as shown the following payoff 
matrix. 

Person 2 





Buy 


Doesn't Buy 


Person 1 Buy 


(25,25) 


(-50, 100) 


Doesn't Buy 


(100,-50) 


(0,0) 



Table 12.3.2: Private Provision of a Discrete Public Good 

Note that net payoffs are defined by r\ — gi. Thus, it is given by 100 - 150/2 = 25 
when both consumers are willing to produce the public project, and 100-150 = -50 when 
only one person wants to buy, but the other person does not. 

This is the prisoner's dilemma. The dominant strategy equilibrium in this game is 
(doesn't buy, doesn't buy). Thus, no body wants to share the cost of producing the 
public project, but wants to free-ride on the other consumer. As a result, the public 
good is not provided at all even thought it would be efficient to do so. Thus, voluntary 
contribution in general does not result in the efficient level of the public good. 

12.3.3 Voting for a Discrete Public Good 

The amount of a public good is often determined by a voting. Will this generally results 
in an efficient provision? The answer is no. 

Voting does not result in efficient provision. Consider the following example. 
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Example 12.3.1 



c = 99 
n = 90, r 2 = 30, r 3 = 30 

Clearly, r\ + r 2 + r 3 > c. ^ = 99/3 = 33. So the efficient provision of the public good 
should be yes. However, under the majority rule, only consumer 1 votes "yes" since she 
receives a positive net benefit if the good is provided. The 2nd and 3rd persons vote "no" 
to produce public good, and therefore, the public good will not be provided so that we 
have inefficient provision of the public good. The problem with the majority rule is that it 
only measures the net benefit for the public good, whereas the efficient condition requires 
a comparison of willingness-to-pay. 



12.4 Continuous Public Goods 
12.4.1 Efficient Provision of Public Goods 

Again, for simplicity, we assume there is only one public good and one private good that 
may be regarded as money, and y = f(v). 

The welfare maximization approach shows that Pareto efficient allocations can be 
characterized by 

n 

max S~]a i u i (x i ,y) 
t=i 

n n 

s.t. Xj + v Wj 

1=1 1=1 

y ^ f(v) 

Define the Lagrangian function: 

n / n n \ 

L = Y1 aiUi ( Xi ' v) + A ( Wi ~ Xi ~ v ) + ^f( v ) ~ y)- ( 12 - 8 ) 
i=i \i=i i=i y 

When Ui is strictly quasi-concave and differentiable and f{y) is concave and differentiate, 
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the set of Pareto optimal allocations are characterized by the first order condition: 

dui 



dL_ 

dxi 
8L 

dv 

dL 

dy 



= : 
= 0: 
= : 



l dxi 
fifiv) = X 

n r, 

EOUi 

i=l Q y 



So at an interior solution, by (12.9) and (12.10) 

<H _ f'(v) 
fi 

Substituting (12.12) into (12.11), 



duj 
dxi 



E 



1 



n duj 

dy 



Thus, we obtain the well-known Lindahl-Samuelson condition. 
In conclusion, the conditions for Pareto efficiency are given by 

£r=i MRS; xi = MRTS yv 

J2 x i + v = YT i= i w i 
y = f(v) 

Example 12.4.1 

Ui = di In y + In Xj 

y = v 



the Lindahl-Samuelson condition is 



and thus 



n duj 

Edy_ 
duj 
i=l dxi 



= V — = 1 ^ Var=ti 



y 



i = l Xi i = l 

which implies the level of the public good is not uniquely determined. 



(12.9) 
(12.10) 
(12.11) 

(12.12) 



(12.13) 



;i2.14) 



(12.15) 



;i2.i6) 



Thus, in general, the marginal willingness-to-pay for a public good depends on the amount 
of private goods consumption, and therefor, the efficient level of y depends on Xj. However, 
in the case of quasi-linear utility functions, 



Ui(xi,y) = Xi + Ui{y) 



(12.17) 



373 



the Lindahl-Samuelson condition becomes 

and thus y is uniquely determined. 
Example 12.4.2 



^ } = c '(y) ( 12 - 18 ) 



Ui = di In y + Xi 
y = v 

the Lindahl-Samuelson condition is 

n duj 

£J| = 1 (12-19) 

i=l dxi 

and thus 

n n 

£- = £- = l=>£o* = l/ (12.20) 

i=i y i=i y 

which implies the level of the public good is uniquely determined. 



12.4.2 Lindahl Equilibrium 

We have given the conditions for Pareto efficiency in the presence of public goods. The 
next problem is how to achieve a Pareto efficient allocation in a decentralized way. In 
private-goods-only economies, any competitive equilibrium is Pareto optimal. However, 
with public goods, a competitive mechanism does not help. For instance, if we tried the 
competitive solution with a public good and two consumers, the utility maximization 
would equalize the MRS and the relative price, e.g., 

MRS* = MRS* = ^. 

yx yx px 

This is an immediate violation to the Samuelson-Lindahl optimal condition. 

Lindahl suggested to use a tax method to provide a public good. Each person is signed 
a specific "personalized price" for the public good. The Lindahl solution is a way to mimic 
the competitive solution in the presence of public goods. Suppose we devise a mechanism 
to allocate the production cost of a public good between consumers, then we can achieve 
the Samuelson-Lindahl condition. To this end, we apply different prices of a public good 
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to different consumers. The idea of the Lindahl solution is that the consumption level of a 
public goods is the same to all consumers, but the price of the public good is personalized 
among consumers in the way that the price ratio of two goods for each person being equal 
the marginal rates of substitutions of two goods. 

To see this, consider a public goods economy e with X{ G R\ (private goods) and 
y G R+ (public goods). For simplicity, we assume the CRS for y = f(v). A feasible 
allocation 

n n 

i=i i=i 

Let qi G R+ be the personalized price vector of consumer i for consuming the public 
goods. 

Let q = Y17=i & '■ ^ ne mar ket price vector of y. 
Let p G R+ be the price vector of private goods. 
The profit is defined as it = qy — pv with y = f(v). 

Definition 12.4.1 (Lindahl Equilibrium) An allocation (x*,y*) G RJ^ +K is a Lin- 
dahl equilibrium allocation if it is feasible and there exists a price vector p* G R+ and 
personalized price vectors q* G R+, one for each individual i, such that 

(i) p*x* + q*y* <; p*Wi, 

(ii) (xi,y) (x*,y*) implies p*x { + q*y > p*Wi] 

(iii) q*y* — p*v* = 0, 

where v* = £" =1 wt - ELi x * and E"=i Q* = 0* ■ 

We call (x*,y*iP*i q\, . . . , g*) a Lindahl equilibrium. 

Remark 12.4.1 Because of CRS, the maximum profit is zero at the Lindahl equilibrium. 

We may regard a Walrasian equilibrium as a special case of a Lindahl equilibrium 
when there are no public goods. In fact, the concept of Lindahl equilibrium in economies 
with public goods is, in many ways, a natural generalization of the Walrasian equilibrium 
notion in private goods economies, with attention to the well-known duality that reverses 
the role of prices and quantities between private and public goods, and between Walrasian 
and Lindahl allocations. In the Walrasian case, prices must be equalized while quantities 
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are individualized; in the Lindahl case the quantities of the public good must be the 
same for everyone, while prices charged for public goods are individualized. In addition, 
the concepts of Walrasian and Lindahl equilibria are both relevant to private-ownership 
economies. Furthermore, they are characterized by purely price-taking behavior on the 
part of agents. It is essentially this property that one can consider the Lindahl solution 
as an informationally decentralized process. 

Similarly, we have the following First Fundamental Theorem of Welfare Economics for 
public goods economies. 

Theorem 12.4.1 : Every Lindahl allocation (x*, y*) with the price system (p*, q*, . . . , g*) 
is weakly Pareto efficient, and further under local non-satiation, it is Pareto efficient. 

Proof: We only prove the second part. The first part is the simple. Suppose not. There 
exists another feasible allocation (xi,y) such that (xi,y) ^ (x*,y*) for all i and (xj,y) >~j 
(x*,y*) for some j. Then, by local-non-satiation of )>=j, we have 

p*Xi + q*y ^ p*Wi for all i — 1, 2, . . . , n 
p*Xj + q*y > p*Wj for some j. 



Thus 



So 



or 



n 



i=l i=l i=l 



n 

i=l i=l 



p*^Xi + q*y > ^p* 



p*^Xi+p*v > ^p* 



n 

i=l i=l 

by noting that q*y — p*v ^ q*y* — p*v* = 0. Hence, 



\) ^2(xi-Wi)+v >0 
_i=i 

which contradicts the fact that (x, y) is feasible. 

For a public economy with one private good and one public good y — ^v, the definition 
of Lindahl equilibrium becomes much simpler. 

An allocation {x*,y*) is a Lindahl Allocation if {x*,y*) is feasible (i.e., XT=i x i + IV* = 

Wi) and there exists q*, i — 1, . . . , n such that 
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(i) x* + qiy* S Wi 

(ii) (xi,y) >-i (x*,y*) implies x { + q { y > Wi 

(iii) Er=i Qi = 1 

In fact, the feasibility condition is automatically satisfied when the budget constraints 
(i) is satisfied. 

If (x*,y*) is an interior Lindahl equilibrium allocation, from the utility maximization, 
we can have the first order condition: 



(12.23) 



duj 

dy Qi 

duj | 

dxi 

which means the Lindahl-Samuelson condition holds: 

n 

MRS yXi = q, 

i=l 

which is the necessary condition for Pareto efficiency 
Example 12.4.3 

Ui(xi,y) = xfy {1 - ai) for < a t < 1 
1 

y = -v 
Q 

The budget constraint is: 

Xi + qiy = Wi. 

The demand functions for Xi and of each i are given by 

Xi = aiWi (12.24) 

(1 — aAwi n 
Vi = ~ — (12.25) 

Qi 

Since y± — y 2 — ■ ■ ■ y n — V* at the equilibrium, we have by (12.25) 

qiy* = (1 - ai)wi. (12.26) 

Making summation, we have 

n 

qy* = - ai)wi. 

i=l 
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Then, we have 



and thus, by (12.26), we have 



Qi = 



(1 - a^Wj 

y* 



q(l - ai)Wi 



(12.27) 



If we want to find a Lindahl equilibrium, we must know the preferences or MRS of each 
consumer. 

But because of the free-rider problem, it is very difficult for consumers to report their 
preferences truthfully. 

12.4.3 Free-Rider Problem 

When the MRS is known, a Pareto efficient allocation (x, y) can be determined from the 
Lindahl-Samuelson condition or the Lindahl solution. After that, the contribution of each 
consumer is given by g t = Wi — Xj. However, the society is hard to know the information 
about MRS. Of course, a naive method is that we could ask each individual to reveal his 
preferences, and thus determine the willingness-to-pay. However, since each consumer is 
self-interested, each person wants to be a free-rider and thus is not willing to tell the true 
MRS. If consumers realize that shares of the contribution for producing public goods (or 
the personalized prices) depend on their answers, they have "incentives to cheat." That 
is, when the consumers are asked to report their utility functions or MRSs, they will have 
incentives to report a smaller MRS so that they can pay less, and consume the public 
good (free riders). This causes the major difficulty in the public economies. 

To see this, notice that the social goal is to reach Pareto efficient allocations for the 
public goods economy, but from the personal interest, each person solves the following 
problem: 



max Ui(xi,y) 



(12.28) 



subject to 



e 



[0,Wi] 



Xi + gi 



n 



y 
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That is, each consumer i takes others' strategies g_j as given, and maximizes his payoffs. 
From this problem, we can form a non-cooperative game: 

r = ((?«, 

where Gj = [0, Wj\ is the strategy space of consumer % and 0j : Gi x G 2 x ... x G n — > R is 
the payoff function of i which is defined by 

n 

M9i,9-i) =Ui[(wi- + (12.29) 

Definition 12.4.2 For the game, T = (Gj, </>i)f=i, the strategy g* = (51*, ...,<?*) is a TVas/j 
Equilibrium if 

M9* , 9->i) ^ S'-J for all # G Gi and all i = 1, 2, n, 

g* is a dominant strategy equilibrium if 

= for all 5- G G and alH = 1, 2, ... 

Remark 12.4.2 Note that the difference between Nash equilibrium (NE) and dominant 
strategy is that at NE, given best strategy of others, each consumer chooses his best 
strategy while dominant strategy means that the strategy chosen by each consumer is 
best regardless of others' strategies. Thus, a dominant strategy equilibrium is clearly 
a Nash equilibrium, but the converse may not be true. Only for a very special payoff 
functions, there is a dominant strategy while a Nash equilibrium exists for a continuous 
and quasi-concave payoff functions that are defined on a compact set. 

For Nash equilibrium, if Ui and / are differentiable, then the first order condition is: 



dgi 

Thus, we have 



^ with equality if & > for alH = 1, . . . , n. (12.30) 



it = + t? m + £> 9i) = with equality if 91 > °- 

So, at an interior solution g*, we have 

duj 1 
8y _ 1 



ft /'(.'/; • E,,,'/,)' 
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and thus 

MRS l yx . = MRTS yv , 

which does not satisfy the Lindahl-Samuelson condition. Thus, the Nash equilibrium in 
general does not result in Pareto efficient allocations. The above equation implies that 
the low level of public good is produced rather than the Pareto efficient level of the public 
good when utility functions are quasi- concave. Therefore, Nash equilibrium allocations 
are in general not consistent with Pareto efficient allocations. How can one solve this 
free-ride problem? We will answer this question in the mechanism design theory. 
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Part V 

Information, Incentives, Mechanism 
Design, and Contract Theory 
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Information economics, incentive theory, mechanism design theory, principal-agent 
theory, contract theory, and auction theory have been very important and active research 
areas and had wide applications in various fields in economics, finance, management, and 
corporate law, political sciences in last five decades. Because of this, about twenty e- 
conomists of founding contributors of mechanism design and the associated files of game 
theory so far have been rewarded with the Nobel prize in economics, including Hayek 
Hayek, Ken Arrow, Gerand Debreu, Ronald Coase, Herbert Simon, John Nash, Rein- 
hard Selten, William Vickrey, James Mirrlees, George Akerlof, Joseph Stiglitz, Michael 
Spence, Robert Auman, Leo Hurwicz, Eric Maskin, Roger Myerson, Peter Diamond, O- 
liver Williamson, Al Roth, and Lloyd S. Shapley. 

The notion of incentives is a basic and key concept in modern economics. To many 
economists, economics is to a large extent a matter of incentives: incentives to work hard, 
to produce good quality products, to study, to invest, to save, etc. 

Until about 40 year ago, economics was mostly concerned with understanding the 
theory of value in large economies. A central question asked in general equilibrium the- 
ory was whether a certain mechanism (especially the competitive mechanism) generated 
Pareto-efncient allocations, and if so - for what categories of economic environments. In 
a perfectly competitive market, the pressure of competitive markets solves the problem of 
incentives for consumers and producers. The major project of understanding how prices 
are formed in competitive markets can proceed without worrying about incentives. 

The question was then reversed in the economics literature: instead of regarding mech- 
anisms as given and seeking the class of environments for which they work, one seeks 
mechanisms which will implement some desirable outcomes (especially those which result 
in Pareto-efncient and individually rational allocations) for a given class of environments 
without destroying participants' incentives, and have a low cost of operation and other 
desirable properties. In a sense, the theorists went back to basics. 

The reverse question was stimulated by two major lines in the history of economic- 
s. Within the capitalist/private-ownership economics literature, a stimulus arose from 
studies focusing upon the failure of the competitive market to function as a mechanism 
for implementing efficient allocations in many nonclassical economic environments such 
as the presence of externalities, public goods, incomplete information, imperfect compe- 



383 



tition, increasing return to scale, etc. At the beginning of the seventies, works by Akerlof 
(1970), Hurwicz (1972), Spence (1974), and Rothschild and Stiglitz (1976) showed in var- 
ious ways that asymmetric information was posing a much greater challenge and could 
not be satisfactorily imbedded in a proper generalization of the Arrow-Debreu theory. 

A second stimulus arose from the socialist/state-ownership economics literature, as 
evidenced in the "socialist controversy" — the debate between Mises-Hayek and Lange- 
Lerner in twenties and thirties of the last century. The controversy was provoked by von 
Mises's skepticism as to even a theoretical feasibility of rational allocation under socialism. 

The incentives structure and information structure are thus two basic features of any 
economic system. The study of these two features is attributed to these two major lines, 
culminating in the theory of mechanism design. The theory of economic mechanism 
design which was originated by Hurwicz is very general. All economic mechanisms and 
systems (including those known and unknown, private-ownership, state-ownership, and 
mixed-ownership systems) can be studied with this theory. 

At the micro level, the development of the theory of incentives has also been a major 
advance in economics in the last forty years. Before, by treating the firm as a black box the 
theory remains silent on how the owners of firms succeed in aligning the objectives of its 
various members, such as workers, supervisors, and managers, with profit maximization. 

When economists began to look more carefully at the firm, either in agricultural or 
managerial economics, incentives became the central focus of their analysis. Indeed, dele- 
gation of a task to an agent who has different objectives than the principal who delegates 
this task is problematic when information about the agent is imperfect. This problem is 
the essence of incentive questions. Thus, conflicting objectives and decentralized 
information are the two basic ingredients of incentive theory. 

We will discover that, in general, these informational problems prevent society from 
achieving the first-best allocation of resources that could be possible in a world where all 
information would be common knowledge. The additional costs that must be incurred 
because of the strategic behavior of privately informed economic agents can be viewed 
as one category of the transaction costs. Although they do not exhaust all possible 
transaction costs, economists have been rather successful during the last thirty years in 
modeling and analyzing these types of costs and providing a good understanding of the 
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limits set by these on the allocation of resources. This line of research also provides 
a whole set of insights on how to begin to take into account agents' responses to the 
incentives provided by institutions. 

The three words — contracts, mechanisms and institutions are to a large extent syn- 
onymous. They all mean "rules of the game," which describe what actions the parties 
can undertake, and what outcomes these actions would be obtained. In most cases the 
rules of the game are designed by someone: in chess, basketball, etc. The rules are de- 
signed to achieve better outcomes. But there is one difference. While mechanism design 
theory may be able answer "big" questions, such as "socialism vs. capitalism," contract 
theory is developed and useful for more manageable smaller questions, concerning specific 
contracting practices and mechanisms. 

Thus, mechanism design is normative economics, in contrast to game theory, which 
is positive economics. Game theory is important because it predicts how a given game 
will be played by agents. Mechanism design goes one step further: given the physical 
environment and the constraints faced by the designer, what goal can be realized or 
implemented? What mechanisms are optimal among those that are feasible? In designing 
mechanisms one must take into account incentive constraints (e.g., consumers may not 
report truthfully how many pairs of shoes they need or how productive they are). 

This part considers the design of contracts in which one or many parties have private 
characteristics or hidden actions. The party who designs the contract will be called the 
Principal, while the other parties will be called agents. For the most part we will focus 
on the situation where the Principal has no private information and the agents do. This 
framework is called screening, because the principal will in general try to screen different 
types of agents by inducing them to choose different bundles. The opposite situation, in 
which the Principal has private information and the agents do not, is called signaling, 
since the Principal could signal his type with the design of his contract. 

We will briefly present the contract theory in four chapters. Chapters 13 and 14 con- 
sider the principal-agent model where the principal delegates an action to a single agent 
with private information. This private information can be of two types: either the agent 
can take an action unobserved by the principal, the case of moral hazard or hidden action; 
or the agent has some private knowledge about his cost or valuation that is ignored by 



385 



the principal, the case of adverse selection or hidden knowledge. The theory of optimal 
mechanism design considers when this private information is a problem for the principal, 
and what is the optimal way for the principal to cope with it. The design of the prin- 
cipal's optimal contract can be regarded as a simple optimization problem. 
This simple focus will turn out to be enough to highlight the various trade- 
offs between allocative efficiency and distribution of information rents arising 
under incomplete information. The mere existence of informational constraints may 
generally prevent the principal from achieving allocative efficiency We will character- 
ize the allocative distortions that the principal finds desirable to implement in order to 
mitigate the impact of informational constraints. 

Chapter 15 will consider situations with one principal and many agents. Asymmetric 
information may not only affect the relationship between the principal and each of his 
agents, but it may also plague the relationships between agents. Moreover, maintaining 
the hypothesis that agents adopt an individualistic behavior, those organizational contexts 
require a solution concept of equilibrium, which describes the strategic interaction between 
agents under complete or incomplete information. 

Chapter 16 will briefly study dynamic contract theory. We will discuss long-term 
incentive contracting in a dynamic principal-agent setting with one-agent and adverse 
selection. We will first consider the case where the principal (designer) can commit to 
a contract forever, and then consider what happens when she cannot commit against 
modifying the contract as new information arrives. 

It is greatly acknowledged that the materials are largely drawn from the references in 
the end of each charter, especially from Laffont and Martimort (1992) for chapters 13 and 
14 and from Segal (2007). 
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Chapter 13 

Optimal Mechanism Design: 
Contracts with One- Agent and 
Hidden Information 

13.1 Introduction 

The optimal contract theory with two parties in large extent is also called principal-agent 
theory. One party is a principal and the other is an agent. 

Incentive problems arise when a principal wants to delegate a task to an agent with 
private information. The exact opportunity cost of this task, the precise technology used, 
and how good the matching is between the agent's intrinsic ability and this technology 
are all examples of pieces of information that may become private knowledge of the agent. 
In such cases, we will say that there is adverse selection. 

Examples 

1. The landlord delegates the cultivation of his land to a tenant, who will be the only 
one to observe the exact local weather conditions. 

2. A client delegates his defense to an attorney who will be the only one to know the 
difficulty of the case. 

3. An investor delegates the management of his portfolio to a broker, who will privately 
know the prospects of the possible investments. 

4. A stockholder delegates the firm's day-to-day decisions to a manager, who will be 
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the only one to know the business conditions. 

5. An insurance company provides insurance to agents who privately know how good 
a driver they are. 

6. The Department of Defense procures a good from the military industry without 
knowing its exact cost structure. 

7. A regulatory agency contracts for service with a public utility company without 
having complete information about its technology. 

The common aspect of all those contracting settings is that the information gap be- 
tween the principal and the agent has some fundamental implications for the design of the 
contract they sign. In order to reach an efficient use of economic resources, some informa- 
tion rent must be given up to the privately informed agent. At the optimal second-best 
contract, the principal trades off his desire to reach allocative efficiency against the costly 
information rent given up to the agent to induce information revelation. Implicit here is 
the idea that there exists a legal framework for this contractual relationship. The contract 
can be enforced by a benevolent court of law, the agent is bounded by the terms of the 
contract. 

The main objective of this chapter is to characterize the optimal rent extraction- 
efficiency trade-off faced by the principal when designing his contractual offer to the agent 
under the set of incentive feasible constraints: incentive and participation constraints. In 
general, incentive constraints are binding at the optimum, showing that adverse selection 
clearly impedes the efficiency of trade. The main lessons of this optimization is that 
the optimal second-best contract calls for a distortion in the volume of trade 
away from the first-best and for giving up some strictly positive information 
rents to the most efficient agents. 
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13.2 Basic Settings of Principal- Agent Model with 
Adverse Selection 



13.2.1 Economic Environment (Technology, Preferences, and In- 
formation) 

Consider a consumer or a firm (the principal) who wants to delegate to an agent the 
production of q units of a good. The value for the principal of these q units is S(q) where 
S' > 0, S" < and S(0) = 0. 

The production cost of the agent is unobservable to the principal, but it is common 
knowledge that the fixed cost is F and the marginal cost belongs to the set $ = {9,0}. 
The agent can be either efficient (9) or inefficient (9) with respective probabilities v and 
1 — v. That is, he has the cost function 

C(q,9) = 9q + F with probability v (13.1) 

or 

C(q, 9) =9q + F with probability 1 - v (13.2) 

Denote by A9 = 9 — 9 > the spread of uncertainty on the agent's marginal cost. This 
information structure is exogenously given to the players. 

13.2.2 Contracting Variables: Outcomes 

The contracting variables are the quantity produced q and the transfer t received by the 
agent. Let A be the set of feasible allocations that is given by 

A= {(q,t) : q G 9fc + ,t G 3?} (13.3) 

These variables are both observable and verifiable by a third party such as a benevolent 
court of law. 

13.2.3 Timing 

Unless explicitly stated, we will maintain the timing defined in the figure below, where A 
denotes the agent and P the principal. 
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Figure 13.1: Timing of contracting under hidden information. 

Note that contracts are offered at the interim stage; there is already asymmetric 
information between the contracting parties when the principal makes his offer. 

13.3 The Complete Information Optimal Contract (Benchma 
Case) 

13.3.1 First-Best Production Levels 

To get a reference system for comparison, let us first suppose that there is no asymmetry 
of information between the principal and the agent. The efficient production levels are 
obtained by equating the principal's marginal value and the agent's marginal cost. Hence, 
we have the following first-order conditions 

S\<f) = 9 (13.4) 

and 

S'(q*) = 9. (13.5) 

The complete information efficient production levels q* and q* should be both carried out 
if their social values, respectively W_* = S(q*) — 6q* — F, and W* = S(q*) — 8q* — F, are 
non-negative. 
Since 

S(q*) - 6q* ^ S(q*) - 6q* ^ S(q*) - 6q* 

by definition of 9 and 9 > 9, the social value of production when the agent is efficient, 
W*, is greater than when he is inefficient, namely W* . 
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For trade to be always carried out, it is thus enough that production be socially 
valuable for the least efficient type, i.e., the following condition must be satisfied 

W* = S(q*) - 9q* - F ^ 0. (13.6) 

As the fixed cost F plays no role other than justifying the existence of a single agent, it 
is set to zero from now on in order to simplify notations. 

Note that, since the principal's marginal value of output is decreasing, the optimal 
production of an efficient agent is greater than that of an inefficient agent, i.e., q* > q*. 

13.3.2 Implementation of the First-Best 

For a successful delegation of the task, the principal must offer the agent a utility level 
that is at least as high as the utility level that the agent obtains outside the relationship. 
We refer to these constraints as the agent's participation constraints. If we normalize to 
zero the agent's outside opportunity utility level (sometimes called his quo utility level), 
these participation constraints are written as 

t-9q^0, (13.7) 

i-9q^0. (13.8) 

To implement the first-best production levels, the principal can make the following 
take-it-or-leave-it offers to the agent: If 9 = 9 (resp. 9), the principal offers the transfer 
t* (resp. t*) for the production level g* (resp. q*) with t* = 9q*(resp.t* = 9q*). Thus, 
whatever his type, the agent accepts the offer and makes zero profit. The complete 
information optimal contracts are thus (£*, q*) if 9 = 9 and (i*, q*) if 9 = 9. Importantly, 
under complete information delegation is costless for the principal, who achieves the same 
utility level that he would get if he was carrying out the task himself (with the same cost 
function as the agent). 
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Figure 13.2: Indifference curves of both types. 

13.3.3 A Graphical Representation of the Complete Information 
Optimal Contract 

Since 9 > 9, the iso-utility curves for different types cross only once as shown in the above 
figure. This important property is called the single-crossing or Spence-Mirrlees property. 

The complete information optimal contract is finally represented Figure 13.3 by the 
pair of points (A*, B*). Note that since the iso-utility curves of the principal correspond 
to increasing levels of utility when one moves in the southeast direction, the principal 
reaches a higher profit when dealing with the efficient type. We denote by V* (resp. 
V*) the principal's level of utility when he faces the 9— (resp. 9—) type. Because the 
principal's has all the bargaining power in designing the contract, we have V* = W* (resp. 
V* = W*) under complete information. 
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Figure 13.3: First best contracts. 

13.4 Incentive Feasible Contracts 

13.4.1 Incentive Compatibility and Participation 

Suppose now that the marginal cost 9 is the agent's private information and let us consider 
the case where the principal offers the menu of contracts {(t*,q*); (i*,q*)} hoping that an 
agent with type 9 will select (£*, q*) and an agent with 9 will select instead (i*,q*). 

From Figure 13.3 above, we see that B* is preferred to A* by both types of agents. 
Offering the menu (A*, B*) fails to have the agents self-selecting properly within this menu. 
The efficient type have incentives to mimic the inefficient one and selects also contract 
B*. The complete information optimal contracts can no longer be implemented under 
asymmetric information. We will thus say that the menu of contracts {(£*,<?*); (£*, <?*)} is 
not incentive compatible. 
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Definition 13.4.1 A menu of contracts {(t,q); (t, q)} is incentive compatible when (t,q) 
is weakly preferred to (i, q) by agent 9 and (t, q) is weakly preferred to (t, q) by agent 9. 

Mathematically, these requirements amount to the fact that the allocations must satisfy 
the following incentive compatibility constraints: 

t-9q^i-9q (13.9) 

and 

t-9q^t-9q (13.10) 

Furthermore, for a menu to be accepted, it must satisfy the following two participation 
constraints: 

t-9q^0, (13.11) 
i-9q^0. (13.12) 

Definition 13.4.2 A menu of contracts is incentive feasible if it satisfies both incentive 
and participation constraints (13.9) through (13.12). 

The inequalities (13.9) through (13.12) fully characterize the set of incentive feasible 
menus of contracts. The restrictions embodied in this set express additional constraints 
imposed on the allocation of resources by asymmetric information between the principal 
and the agent. 

13.4.2 Special Cases 

Bunching or Pooling Contracts: A first special case of incentive feasible menu of 
contracts is obtained when the contracts targeted for each type coincide, i.e., when t = 
t = t p ,q = q = q' p and both types of agent accept this contract. 

Shutdown of the Least Efficient Type: Another particular case occurs when one 
of the contracts is the null contract (0,0) and the nonzero contract (t s , q s ) is only accepted 
by the efficient type. Then, (13.9) and (13.11) both reduce to 

t s -9q s ^0. (13.13) 



394 



The incentive constraint of the bad type reduces to 



^ t s - 6q s . 



(13.14) 



As with the pooling contract, the benefit of the (0, 0) option is that it somewhat 
reduces the number of constraints since the incentive and participation constraints take 
the same form. The cost of such a contract may be an excessive screening of types. Here, 
the screening of types takes the rather extreme form of the least efficient type. 

13.4.3 Monotonicity Constraints 

Incentive compatibility constraints reduce the set of feasible allocations. Moreover, these 
quantities must generally satisfy a monotonicity constraint which does not exist under 
complete information. Adding (13.9) and (13.10), we immediately have 



We will call condition (13.15) an implementability condition that is necessary and suffi- 
cient for implementability. 

Indeed, suppose that (13.15) holds; it is clear that there exists transfers t and t such 
that the incentive constraints (13.9) and (13.10) both hold. It is enough to take those 
transfers such that 



Remark 13.4.1 In this two-type model, the conditions for implementability take a sim- 
ple form. With more than two types (or with a continuum), the characterization of these 
conditions might get harder. The conditions for implementability are also more difficult 
to characterize when the agent performs several tasks on behalf of the principal. 

13.5 Information Rents 

To understand the structure of the optimal contract it is useful to introduce the concept 
of information rent. 



qgq. 



(13.15) 



0(q- q) <t-t < 9(q-q). 



(13.16) 
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We know from previous discussion, under complete information, the principal is able 
to maintain all types of agents at their zero status quo utility level. Their respective 
utility levels U* and U* at the first-best satisfy 

U*=t*-9q* = (13.17) 

and 

U* = i* -6q* = 0. (13.18) 

Generally this will not be possible anymore under incomplete information, at least when 
the principal wants both types of agents to be active. 

We use the notations U_ — t — 9q and U = i — 9q to denote the respective information 
rent of each type. 

Take any menu {(i, q); (t, q)} of incentive feasible contracts. How much would a #-agent 
get by mimicking a 9- agent? The high-efficient agent would get 

t-9q = i-9q + A9q = U + A9q. (13.19) 

Thus, as long as the principal insists on a positive output for the inefficient type, q > 0, 
the principal must give up a positive rent to a 9- agent. This information rent is generated 
by the informational advantage of the agent over the principal. 



13.6 The Optimization Program of the Principal 

According to the timing of the contractual game, the principal must offer a menu of 
contracts before knowing which type of agent he is facing. Then, the principal's problem 
writes as 

max 1/(5(9) - t) + (1 - u)(S(q) - i) 

{(t,q);{t,q)} 

subject to (13.9) to (13.12). 

Using the definition of the information rents U_ = t — 9_q and U = t — 9q, we can replace 
transfers in the principal's objective function as functions of information rents and outputs 
so that the new optimization variables are now {(IZ, q)', (U, q)}. The focus on information 
rents enables us to assess the distributive impact of asymmetric information, and the 
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focus on outputs allows us to analyze its impact on allocative efficiency and the overall 
gains from trade. Thus an allocation corresponds to a volume of trade and a distribution 
of the gains from trade between the principal and the agent. 

With this change of variables, the principal's objective function can then be rewritten 

as 

u(S(q) - 6q) + (1 - u)(S(q) - 6q) - (uU + (1 - u)U) . (13.20) 
v „ ' v v ' 

The first term denotes expected allocative efficiency, and the second term denotes expected 
information rent which implies that the principal is ready to accept some distortions away 
from efficiency in order to decrease the agent's information rent. 

The incentive constraints (13.9) and (13.10), written in terms of information rents and 
outputs, becomes respectively 

U^U + AGq, (13.21) 
U^U-AOq. (13.22) 
The participation constraints (13.11) and (13.12) become respectively 

U^O, (13.23) 

U^O. (13.24) 
The principal wishes to solve problem (P) below: 

max u(S(q) - 6q) + (1 - v)(S(q) - 9q) - (vU + (1 - v)U) 

{(U,q);(U,q)} 

subject to (13.21) to (13.24). 

We index the solution to this problem with a superscript SB, meaning second-best. 

13.7 The Rent Extraction- Efficiency Trade-Off 

13.7.1 The Optimal Contract Under Asymmetric Information 

The major technical difficulty of problem (P) is to determine which of the many con- 
straints imposed by incentive compatibility and participation are the relevant ones, i.e., 
the binding ones at the optimum or the principal's problem. 
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Let us first consider contracts without shutdown, i.e., such that q > 0. This is true 
when the so-called Inada condition S"(0) = +00 is satisfied and \im q ^. S'(q)q = 0. 

Note that the ^-agent's participation constraint (13.23) is always strictly-satisfied. 
Indeed, (13.24) and (13.21) immediately imply (13.23). (13.22) also seems irrelevant 
because the difficulty comes from a #-agent willing to claim that he is inefficient rather 
than the reverse. 

This simplification in the number of relevant constraints leaves us with only two 
remaining constraints, the ^-agent's incentive compatible constraint (13.21) and the 9- 
agent's participation constraint (13.24), and both constraints must be binding at the 
optimum of the principal's problem (P): 

U = A9q (13.25) 

and 

U = 0. (13.26) 

Substituting (13.25) and (13.26) into the principal's objective function, we obtain a re- 
duced program (P 1 ) with outputs as the only choice variables: 

max u(S(q) - 9q) + (1 - u)(S(q) - 9q) - (uA9q). 
{(?,«)} 

Compared with the full information setting, asymmetric information alters the principal's 
optimization simply by the subtraction of the expected rent that has to be given up to the 
efficient type. The inefficient type gets no rent, but the efficient type 9 gets information 
rent that he could obtain by mimicking the inefficient type 9. This rent depends only on 
the level of production requested from this inefficient type. 
The first order conditions are then given by 

S'(q SB ) = orq SB = q*. (13.27) 

and 

(1 - v)(S'(q SB ) -9) = uA9. (13.28) 

(13.28) expresses the important trade-off between efficiency and rent extraction which 
arises under asymmetric information. 

To validate our approach based on the sole consideration of the efficient type's incentive 
compatible constraint, it is necessary to check that the omitted incentive compatible 
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constraint of an inefficient agent is satisfied, i.e., ^ A9q SB — A9q SB . This latter 
inequality follows from the monotonicity of the second-best schedule of outputs since we 
have q SB = q* > q* > q SB . 

In summary, we have the following proposition. 

Proposition 13.7.1 Under asymmetric information, the optimal contracts entail: 

(1) No output distortion for the efficient type in respect to the first-best, q SB = 
q* . A downward output distortion for the inefficient type, q SB < q* with 

S'{q SB ) = 9+ Y^ M - ( 13 - 29 ) 

(2) Only the efficient type gets a positive information rent given by 

U SB = A9q SB . (13.30) 

(3) The second-best transfers are respectively given by t SB = 9q* + A9q SB and 
i SB = 9q SB . 

Remark 13.7.1 The basic idea and insides of principal-agent theory was somewhat re- 
vealed in the book of "the Art of War" written by Sun Tzu in the 6-th century BC. It was 
the ancient Chines military treatise. The book was China's earliest and most outstanding 
and complete work on warcraft. Dubbed "the Bible of Military Science", it is also the 
earliest work on military strategies in the world. He considered the critical importance 
of information by the saying that: "If you know the enemy and know yourself, you need 
not fear the result of a hundred battles. If you know yourself but not the enemy, for 
every victory gained you will also suffer a defeat. If you know neither the enemy nor 
yourself, you will succumb in every battle." That is, the best you can do is the first best 
when information is complete, the best you can do is the second best when information 
is asymmetric. It would be worse you know thing about the others and yourself. 
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13.7.2 A Graphical Representation of the Second-Best Outcome 
/ 

U* = t-&q = 



V / 




Figure 13.4: Rent needed to implement the first best outputs. 

Starting from the complete information optimal contract (A*, B*) that is not incentive 
compatible, we can construct an incentive compatible contract (B*,C) with the same 
production levels by giving a higher transfer to the agent producing q* as shown in the 
figure above. The contract C is on the ^-agent's indifference curve passing through B*. 
Hence, the #-agent is now indifferent between B* and C. (B*,C) becomes an incentive- 
compatible menu of contracts. The rent that is given up to the f-firrn is now A8q*. This 
contract is not optimal by the first order conditions (13.27) and (13.28). The optimal 
trade-off finally occurs at (A SB , B SB ) as shown in the figure below. 

13.7.3 Shutdown Policy 

If the first-order condition in (13.29) has no positive solution, q SB should be set at zero. 
We are in the special case of a contract with shutdown. B SB coincides with and A SB 
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Figure 13.5: Optimal second-best contract S SB and B SB . 

with A* in the figure above. No rent is given up to the #-firm by the unique non-null 
contract (£*, q*) offered and selected only by agent 9. The benefit of such a policy is that 
no rent is given up to the efficient type. 

Remark 13.7.2 The shutdown policy is dependent on the status quo utility levels. Sup- 
pose that, for both types, the status quo utility level is U > 0. Then, from the principal's 
objective function, we have 

-^—Mq SB + U ^ S(q SB ) - 6q SB . (13.31) 

Thus, for v large enough, shutdown occurs even if the Inada condition S"(0) = +oo is 
satisfied. Note that this case also occurs when the agent has a strictly positive fixed cost 
F > (to see that, just set U — F). 

The occurrence of shutdown can also be interpreted as saying that the principal has 
another choice variable to solve the screening problem. This extra variable is the subset of 
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types, which are induced to produce a positive amount. Reducing the subset of producing 
agents obviously reduces the rent of the most efficient type. 

13.8 The Theory of the Firm Under Asymmetric In- 
formation 

When the delegation of task occurs within the firm, a major conclusion of the above 
analysis is that, because of asymmetric information, the firm does not maximize the social 
value of trade, or more precisely its profit, a maintained assumption of most economic 
theory. This lack of allocative efficiency should not be considered as a failure in the 
rational use of resources within the firm. Indeed, the point is that allocative efficiency 
is only one part of the principal's objective. The allocation of resources within the firm 
remains constrained optimal once informational constraints are fully taken into account. 

Williamson (1975) has advanced the view that various transaction costs may im- 
pede the achievement of economic transactions. Among the many origins of these costs, 
Williamson stresses informational impact as an important source of inefficiency. Even in 
a world with a costless enforcement of contracts, a major source of allocative inefficiency 
is the existence of asymmetric information between trading partners. 

Even though asymmetric information generates allocative inefficiencies, those efficien- 
cies do not call for any public policy motivated by reasons of pure efficiency. Indeed, any 
benevolent policymaker in charge of correcting these inefficiencies would face the same in- 
formational constraints as the principal. The allocation obtained above is Pareto optimal 
in the set of incentive feasible allocations or incentive Pareto optimal. 

13.9 Asymmetric Information and Marginal Cost Pric- 
ing 

Under complete information, the first-best rules can be interpreted as price equal to 
marginal cost since consumers on the market will equate their marginal utility of con- 
sumption to price. 

Under asymmetric information, price equates marginal cost only when the producing 

402 



firm is efficient (9 = 9). Using (13.29), we get the expression of the price p(9) for the 
inefficient types output 

p{9) = 9 + -^—A9. (13.32) 

Price is higher than marginal cost in order to decrease the quantity q produced by the 
inefficient firm and reduce the efficient firm's information rent. Alternatively, we can say 
that price is equal to a generalized (or virtual) marginal cost that includes, in addition to 
the traditional marginal cost of the inefficient type 9, an information cost that is worth 
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13.10 The Revelation Principle 

In the above analysis, we have restricted the principal to offer a menu of contracts, one 
for each possible type. One may wonder if a better outcome could be achieved with a 
more complex contract allowing the agent possibly to choose among more options. The 
revelation principle ensures that there is no loss of generality in restricting the principal 
to offer simple menus having at most as many options as the cardinality of the type space. 
Those simple menus are actually examples of direct revelation mechanisms. 

Definition 13.10.1 A direct revelation mechanism is a mapping g(-) from to A which 
writes as g(9) = {q{9), t(9)) for all belonging to 0. The principal commits to offer the 
transfer t{9) and the production level q{9) if the agent announces the value 9 for any 9 
belonging to 0. 

Definition 13.10.2 A direct revelation mechanism g(-) is truthful if it is incentive com- 
patible for the agent to announce his true type for any type, i.e., if the direct revelation 
mechanism satisfies the following incentive compatibility constraints: 

t{9) - 9q{9_) ^ t{9) - 9q(9), (13.33) 

t(9) - 9q{9) ^ t{9) - 9q{9). (13.34) 

Denoting transfer and output for each possible report respectively as t{9_) = t, q{9) = q, 
t(9) = i and q(9) = q, we get back to the notations of the previous sections. 
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A more general mechanism can be obtained when communication between the prin- 
cipal and the agent is more complex than simply having the agent report his type to the 
principal. 

Let M be the message space offered to the agent by a more general mechanism. 

Definition 13.10.3 A mechanism is a message space M and a mapping g(-) from M to 
A which writes as g{m) = (q(m),t(m)) for all m belonging to M . 

When facing such a mechanism, the agent with type 9 chooses a best message m*{9) that 
is implicitly defined as 

i{m*{9)) - 9q{m*{9)) ^ t{m) - 9q{fh) for all rh G M. (13.35) 

The mechanism (M,g(-)) induces therefore an allocation rule a{6) = (q(m*(9)),t(m*(9))) 
mapping the set of types 9 into the set of allocations A. 

Then we have the following revelation principle in the one agent case. 

Proposition 13.10.1 Any allocation rule a(9) obtained with a mechanism (M,g(-)) can 
also be implemented with a truthful direct revelation mechanism. 

Proof. The indirect mechanism (M, <?(•)) induces an allocation rule a(9) = (q(m*(9)), 
t(m*(0))) from M into A. By composition of q(-) and m*(-), we can construct a direct 
revelation mechanism g(-) mapping into A, namely g = g o m*, or more precisely 
g(0) = (q(0), t(0)) = ~g(m*(0)) = (q(m*(0)),t(m*(0))) for all 9 G 0. 

We check now that the direct revelation mechanism g(-) is truthful. Indeed, since 
(13.35) is true for all rh, it holds in particular for rh = m*{0') for all 6' G 0. Thus we 
have 

i{m*{0)) - 0q{m*{0)) ^ t{m*{0')) - 0q{m*{0')) for all {0, 9') G 2 . (13.36) 

Finally, using the definition of (?(•), we get 

t{0) - 0q{6) ^ t{9') - 9q{9') for all (9, 9') G 2 . (13.37) 

Hence, the direct revelation mechanism g(-) is truthful. 

Importantly, the revelation principle provides a considerable simplification of contract 
theory. It enables us to restrict the analysis to a simple aid well-defined family of functions, 
the truthful direct revelation mechanism. 
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13.11 A More General Utility Function for the Agent 

Still keeping quasi-linear utility functions, let U = t — C(q, 6) now be the agent's objective 
function in the assumptions: C q > 0, Ce > 0, C qq > and C q e > 0. The generalization of 
the Spence-Mirrlees property is now C q e > 0. This latter condition still ensures that the 
different types of the agent have indifference curves which cross each other at most once. 
This Spence-Mirrlees property is quite clear: a more efficient type is also more efficient 
at the margin. 

Incentive feasible allocations satisfy the following incentive and participation con- 
straints: 

U = t-C(q,6)^i-C(q,6), (13.38) 
U = t-C(q,9)^t_-C(q,9), (13.39) 
U = t-C(q,6) ^ 0, (13.40) 
U = t-C(q,9)^0. (13.41) 



13.11.1 The Optimal Contract 

Just as before, the incentive constraint of an efficient type in (13.38) and the participation 
constraint of an inefficient type in (13.41) are the two relevant constraints for optimization. 
These constraints rewrite respectively as 

U ^ U + $(g) (13.42) 

where $(g) = C(q,6) - C{q,6) (with $' > and $" > 0), and 

U^0. (13.43) 

Those constraints are both binding at the second-best optimum, which leads to the fol- 
lowing expression of the efficient type's rent 



U = $(q). (13.44) 

Since $' > 0, reducing the inefficient agent's output also reduces the efficient agent's 
information rent. 
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With the assumptions made on C(-), one can also check that the principal's objective 
function is strictly concave with respect to outputs. 

The solution of the principal's program can be summarized as follows: 

Proposition 13.11.1 With general preferences satisfying the Spence-Mirrlees property, 
C q e > 0, the optimal menu of contracts entails: 

(1) No output distortion with respect to the first-best outcome for the efficient 
type, q SB = q* with 

S'(q*) = C q (q*,6). (13.45) 
A downward output distortion for the inefficient type, q SB < q* with 

S'(q*) = C q (q*,6) (13.46) 

and 

S'(q SB ) = C q (q SB , 9) + \q SB ). (13.47) 

(2) Only the efficient type gets a positive information rent given by U_ SB = 

(3) The second-best transfers are respectively given byt SB = C(q* ,9) + &(q SB ) 
andi SB = C(q SB ,9). 

The first-order conditions (13.45) and (13.47) characterize the optimal solution if the 
neglected incentive constraint (13.39) is satisfied. For this to be true, we need to have 

i SB -C(q SB ,6) ^ t SB -C(q SB ,6), 

= t SB -C(f B ,9) + C(q SB ,9)-C(q SB ,9) (13.48) 

by noting that (13.38) holds with equality at the optimal output such that t SB = i SB — 
C(q SB ,9) + C(q SB ,9). Thus, we need to have 

^ $(g 5B ) - $(q SB ). (13.49) 

Since $' > from the Spence-Mirrlees property, then (13.49) is equivalent to q SB ^ q SB . 
But from our assumptions we easily derive that q SB = q* > q* > q SB . So the Spence- 
Mirrlees property guarantees that only the efficient type's incentive compatible constraint 
has to be taken into account. 
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Remark 13.11.1 When there are the joint presence of asymmetric information and net- 
work externalities, "no distortion on the top" rule may not be true so that the most 
efficient agent may not have the first-best outcome. Meng and Tian (2008) showed that 
the optimal contract for the principal may exhibit two-way distortion: the output of any 
agent is oversupplied (relative to the first-best) when his marginal cost of effort is low, 
and undersupplied when his marginal cost of effort is high. 

The "network externalities" might arise for any of the following reasons: because the 
usefulness of the commodity depends directly on the size of the network (e.g., telephones, 
fax machines); or because of bandwagon effect, which means the desire to be in style: to 
have a good because almost everyone else has it; or indirectly through the availability of 
complementary goods and services (often known as the "hardware-software paradigm" ) 
or of postpurchase services (e.g., for automobiles). Although "network externalities" are 
often regarded as positive impact on each others' consumption, it may display negative 
properties in some cases. For example, people has the desire to own exclusive or unique 
goods, which is called "Snob effect" . The quantity demanded of a "snob" good is higher 
the fewer the people who own it. 

13.11.2 More than One Good 

Let us now assume that the agent is producing a whole vector of goods q — (q±, q n ) for 
the principal. The agents' cost function becomes C(q,9) with C(-) being strictly convex 
in q. The value for the principal of consuming this whole bundle is now S(q) with S(-) 
being strictly concave in q. 

In this multi-output incentive problem, the principal is interested in a whole set of 
activities carried out simultaneously by the agent. It is straightforward to check that the 
efficient agent's information rent is now written as U_ — $(g) with $(g) = C(q, 9)—C(q, 9). 
This leads to second-best optimal outputs. The efficient type produces the first-best vector 
of outputs q SB = q* with 

S Q M*) = C qi {q\9) foralHe{l,...,n}. (13.50) 

The inefficient types vector of outputs q SB is instead characterized by the first-order 



407 



conditions 

S q Af B ) = C m (f B ,0) + ^%M SB ) for all i E {1, ..,n}, (13.51) 

which generalizes the distortion of models with a single good. 

Without further specifying the value and cost functions, the second-best outputs define 
a vector of outputs with some components qf B above q* for a subset of indices i. 

Turning to incentive compatibility, summing the incentive constraints U_ ^ U + $(g) 
and U ^ U_ — $(g) for any incentive feasible contract yields 

<%) = CM)-C(?,0 (13.52) 
^ C(q,e)-C(q,0) 

= $(g) for all implementable pairs (g, g). (13.53) 

Obviously, this condition is satisfied if the Spence-Mirrlees property C qi e > holds for 
each output i and if the monotonicity conditions qi < q. for all % are satisfied. 

13.12 Ex Ante versus Ex Post Participation Con- 
straints 

The case of contracts we consider so far is offered at the interim stage, i.e., the agent 
already knows his type. However, sometimes the principal and the agent can contract at 
the ex ante stage, i.e., before the agent discovers his type. For instance, the contracts 
of the firm may be designed before the agent receives any piece of information on his 
productivity. In this section, we characterize the optimal contract for this alternative 
timing under various assumptions about the risk aversion of the two players. 

13.12.1 Risk Neutrality 

Suppose that the principal and the agent meet and contract ex ante. If the agent is risk 
neutral, his ex ante participation constraint is now written as 

vU + (l-v)U^0. (13.54) 

This ex ante participation constraint replaces the two interim participation constraints. 

408 



Since the principal's objective function is decreasing in the agent's expected informa- 
tion rent, the principal wants to impose a zero expected rent to the agent and have (13.54) 
be binding. Moreover, the principal must structure the rents U_ and U to ensure that the 
two incentive constraints remain satisfied. An example of such a rent distribution that 
is both incentive compatible and satisfies the ex ante participation constraint with an 
equality is 

U* = (1 - v)A6q* > and U* = -vA6q* < 0. (13.55) 

With such a rent distribution, the optimal contract implements the first-best outputs 
without cost from the principal's point of view as long as the first-best is monotonic 
as requested by the implementability condition. In the contract defined by (13.55), the 
agent is rewarded when he is efficient and punished when he turns out to be inefficient. 
In summary, we have 

Proposition 13.12.1 When the agent is risk neutral and contracting takes place ex ante, 
the optimal incentive contract implements the first-best outcome. 

Remark 13.12.1 The principal has in fact more options in structuring the rents XJ_ and 
U in such a way that the incentive compatible constraints hold and the ex ante par- 
ticipation constraint (13.54) holds with an equality. Consider the following contracts 
{(f , q*); (t*, q*)} where t* = S{q*) - T* and t* = S{q*) - T*, with T* being a lump-sum 
payment to be defined below. This contract is incentive compatible since 

f - 0q* = S(q*) - 6q* - T* > S(q*) - 6q* - T* = t* - 6q* (13.56) 
by definition of q*, and 

t* - 6q* = S(q*) - 0q* - T* > S(q*) - 6q* - T* = f - 6q* (13.57) 
by definition of q*. 

Note that the incentive compatibility constraints are now strict inequalities. Moreover, 
the fixed-fee T* can be used to satisfy the agent's ex ante participation constraint with an 
equality by choosing T* = v(S(q*) — 9q*) + (l — v)(S(q*)—9q*). This implementation of the 
first-best outcome amounts to having the principal selling the benefit of the relationship 
to the risk- neutral agent for a fixed up-front payment T*. The agent benefits from the 
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full value of the good and trades off the value of any production against its cost just as 
if he was an efficiency maximizer. We will say that the agent is residual claimant for the 
firms profit. 



13.12.2 Risk Aversion 

A Risk-Averse Agent 

The previous section has shown us that the implementation of the first-best is feasible 
with risk neutrality What happens if the agent is risk-averse? 

Consider now a risk-averse agent with a Von Neumann-Morgenstern utility function 
u(-) defined on his monetary gains t — Oq, such that u' > 0, u" < and u(0) = 0. Again, 
the contract between the principal and the agent is signed before the agent discovers 
his type. The incentive-compatibility constraints are unchanged but the agent's ex ante 
participation constraint is now written as 

uu(U) + (1 - u)u(U) ^ 0. (13.58) 

As usual, one can check incentive-compatibility constraint (13.22) for the inefficient agent 
is slack (not binding) at the optimum, and thus the principal's program reduces now to 

max u(S(q) - 6q - U) + (1 - u)(S(q) -6q-U), 

{(U,q);(U,q)} ~ 

subject to (13.21) and (13.58). 

We have the following proposition. 

Proposition 13.12.2 When the agent is risk-averse and contracting takes place ex ante, 
the optimal menu of contracts entails: 

(1) No output distortion for the efficient q SB = q* . A downward output dis- 
tortion for the inefficient type q SB < q* , with 



s\f B ) = o + , ' t,,^ ^- ( 13 - 59 ) 



v{u'{U SB ) -u'{U SB )) 
uu'{U_ SB ) + (l-u)u'{U SB )' 



(2) Both (13.21) and (13.58) are the only binding constraints. The efficient 
(resp. inefficient) type gets a strictly positive (resp. negative) ex post 
information rent, U_ SB > > U SB . 
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Proof: Define the following Lagrangian for the principals problem 

L(q,q,U,U,\,tx) = is(S(q) - 9q- U) + (1 - u)(S(q) - 9q - U) 

+X(U -U - A6q) + n(vu(U) + (1 - v)u(U)). (13.60) 

Optimizing w.r.t. U_ and U yields respectively 

-u + \ + fiuu\U SB ) =0 (13.61) 

-(1 — u) — X + fi(l - v)u'{U SB ) = 0. (13.62) 
Summing the above two equations, we obtain 

li(vu'(U SB ) + (1 - v)u'(U SB )) = I. (13.63) 



and thus [i > 0. Using (13.63) and inserting it into (13.61) yields 

u(l-u)(u'(U SB )-u'(U SB ) ) 
vu>(U SB ) + (1 - v)u'(U SB ) 



A = T^X:^ (13-64) 



Moreover, (13.21) implies that U_ SB ^ U SB and thus A ^ 0, with A > for a positive 
output y. 

Optimizing with respect to outputs yields respectively 

S'(q SB ) = (13.65) 

and 

s'(f B ) = e + y\ A0 - (13 - 66) 

Simplifying by using (13.64) yields (13.59). 

Thus, with risk aversion, the principal can no longer costlessly structure the agent's 
information rents to ensure the efficient type's incentive compatibility constraint. Creating 
a wedge between U_ and U to satisfy (13.21) makes the risk-averse agent bear some risk. 
To guarantee the participation of the risk-averse agent, the principal must now pay a risk 
premium. Reducing this premium calls for a downward reduction in the inefficient type's 
output so that the risk borne by the agent is lower. As expected, the agent's risk aversion 
leads the principal to weaken the incentives. 

When the agent becomes infinitely risk averse, everything happens as if he had an ex 
post individual rationality constraint for the worst state of the world given by (13.24). 
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In the limit, the inefficient agent's output q SB and the utility levels U_ SB and U SB all 
converge toward the same solution. So, the previous model at the interim stage can also 
be interpreted as a model with an ex ante infinitely risk-agent at the zero utility level. 

A Risk-Averse Principal 

Consider now a risk-averse principal with a Von Neumann-Morgenstern utility function 
v(-) defined on his monetary gains from trade S(q) — t such that v' > 0, v" < and 
v (0) = 0. Again, the contract between the principal and the risk- neutral agent is signed 
before the agent knows his type. 

In this context, the first-best contract obviously calls for the first-best output q* and 
q* being produced. It also calls for the principal to be fully insured between both states 
of nature and for the agent's ex ante participation constraint to be binding. This leads 
us to the following two conditions that must be satisfied by the agent's rents U* and U*\ 

S{g) - 6q* - U* = S{q*) - 6q* - U* (13.67) 

and 

vU* + (1 - v)U* = 0. (13.68) 
Solving this system of two equations with two unknowns (U*, U*) yields 

U* = (l- v)(S(q*) - 6q* - (S(q*) - 6q*)) (13.69) 

and 

U* = -u(S(q*) - 6q* - {S{q*) - 6q*)). (13.70) 

Note that the first-best profile of information rents satisfies both types' incentive 
compatibility constraints since 

U* -U* = S{q*) - Oq* - (S(q*) - 6q*) > A6q* (13.71) 

(from the definition of q*) and 

U* -U* = S{q*) - 6q* - {S(q*) - 6<f) > -A9q*, (13.72) 

(from the definition of q*). Hence, the profile of rents (U*, U*) is incentive compatible and 
the first-best allocation is easily implemented in this framework. We can thus generalize 
the proposition for the case of risk neutral as follows: 
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Proposition 13.12.3 When the principal is risk-averse over the monetary gains S(q) — t, 
the agent is risk-neutral, and contracting takes place ex ante, the optimal incentive contract 
implements the first-best outcome. 

Remark 13.12.2 It is interesting to note that U* and U* obtained in (13.69) and (13.70) 
are also the levels of rent obtained in (13.56) and (13.57). Indeed, the lump-sum payment 
T* = u(S(q*) — 9q*) + (1 — u)(S(q*) — 9q*), which allows the principal to make the 
risk-neutral agent residual claimant for the hierarchy's profit, also provides full insurance 
to the principal. By making the risk-neutral agent the residual claimant for the value 
of trade, ex ante contracting allows the risk-averse principal to get full insurance and 
implement the first-best outcome despite the informational problem. 

Of course this result does not hold anymore if the agent's interim participation con- 
straints must be satisfied. In this case, we still guess a solution such that (13.23) is slack 
at the optimum. The principal's program now reduces to: 

max uv(S(q) - 9q - U) + (1 - u)v(S(q) -9q-U) 

{(U,q);U,q)} ~ 

subject to (13.21) to (13.24). 

Inserting the values of U_ and U that were obtained from the binding constraints in 
(13.21) and (13.24) into the principal's objective function and optimizing with respect to 
outputs leads to q SB = q* , i.e., no distortion for the efficient type, just as in the ease of 
risk neutrality and a downward distortion of the inefficient type's output q SB < q* given 
by 

^ B >^ V^V - (l3 - 73) 

where V_ SB = S (q*) — 9q* — A9q SB and V SB = S(q SB )—9q SB are the principal's payoffs in 
both states of nature. We can check that V SB < V_ SB since S(q SB ) - 9q SB < S(q*) - 9q* 
from the definition of q*. In particular, we observe that the distortion in the right-hand 
side of (13.73) is always lower than j^A9, its value with a risk-neutral principal. The 
intuition is straightforward. By increasing q above its value with risk neutrality, the risk- 
averse principal reduces the difference between V 33 and V SB . This gives the principal 
some insurance and increases his ex ante payoff. 

For example, if v(x) = ±=f^, (13.73) becomes S'{q SB ) = 9 + ^-e< ?SB -^ s ^A9. If 
r = 0, we get back the distortion obtained before for the case of with a risk- neutral 
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principal and interim participation constraints for the agent. Since V SB < V s B , we 
observe that the first-best is implemented when r goes to infinity. In the limit, the 
infinitely risk-averse principal is only interested in the inefficient state of nature for which 
he wants to maximize the surplus, since there is no rent for the inefficient agent. Moreover, 
giving a rent to the efficient agent is now without cost for the principal. 

Risk aversion on the side of the principal is quite natural in some contexts. A local 
regulator with a limited budget or a specialized bank dealing with relatively correlated 
projects may be insufficiently diversified to become completely risk neutral. See Lewis and 
Sappington (Rand J. Econ, 1995) for an application to the regulation of public utilities. 

13.13 Commitment 

To solve the incentive problem, we have implicitly assumed that the principal has a strong 
ability to commit himself not only to a distribution of rents that will induce information 
revelation but also to some allocative inefficiency designed to reduce the cost of this 
revelation. Alternatively, this assumption also means that the court of law can perfectly 
enforce the contract and that neither renegotiating nor reneging on the contract is a 
feasible alternative for the agent and (or) the principal. What can happen when either of 
those two assumptions is relaxed? 

13.13.1 Renegotiating a Contract 

Renegotiation is a voluntary act that should benefit both the principal and the agent. It 
should be contrasted with a breach of contract, which can hurt one of the contracting 
parties. One should view a renegotiation procedure as the ability of the contracting 
partners to achieve a Pareto improving trade if any becomes incentive feasible along the 
course of actions. 

Once the different types have revealed themselves to the principal by selecting the con- 
tracts (t SB , q SB ) for the efficient type and (i SB , q SB ) for the inefficient type, the principal 
may propose a renegotiation to get around the allocative inefficiency he has imposed on 
the inefficient agent's output. The gain from this renegotiation comes from raising alloca- 
tive efficiency for the inefficient type and moving output from q SB to q*. To share these 
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new gains from trade with the inefficient agent, the principal must at least offer him the 
same utility level as before renegotiation. The participation constraint of the inefficient 
agent can still be kept at zero when the transfer of this type is raised from i SB = 9q SB to 
t* = 6q*. However, raising this transfer also hardens the ex ante incentive compatibility 
constraint of the efficient type. Indeed, it becomes more valuable for an efficient type 
to hide his type so that he can obtain this larger transfer, and truthful revelation by 
the efficient type is no longer obtained in equilibrium. There is a fundamental trade-off 
between raising efficiency ex post and hardening ex ante incentives when renegotiation is 
an issue. 

13.13.2 Reneging on a Contract 

A second source of imperfection arises when either the principal or the agent reneges on 
their previous contractual obligation. Let us take the case of the principal reneging on 
the contract. Indeed, once the agent has revealed himself to the principal by selecting the 
contract within the menu offered by the principal, the latter, having learned the agent's 
type, might propose the complete information contract which extracts all rents without 
inducing inefficiency. On the other hand, the agent may want to renege on a contract 
which gives him a negative ex post utility level as we discussed before. In this case, the 
threat of the agent reneging a contract signed at the ex ante stage forces the agent's 
participation constraints to be written in interim terms. Such a setting justifies the focus 
on the case of interim contracting. 

13.14 Informative Signals to Improve Contracting 

In this section, we investigate the impacts of various improvements of the principal's 
information system on the optimal contract. The idea here is to see how signals that are 
exogenous to the relationship can be used by the principal to better design the contract 
with the agent. 
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13.14.1 Ex Post Verifiable Signal 

Suppose that the principal, the agent and the court of law observe ex post a viable signal 
a which is correlated with 9. This signal is observed after the agent's choice of production. 
The contract can then be conditioned on both the agent's report and the observed signal 
that provides useful information on the underlying state of nature. 

For simplicity, assume that this signal may take only two values, <j\ and o 2 . Let the 
conditional probabilities of these respective realizations of the signal be //i = Pr(o" = 
0i/0 = ^ I/ 2 and fx 2 = Pr(a = a 2 /9 = 9) ^ 1/2. Note that, if ^ = /i 2 = 1/2, 
the signal a is uninformative. Otherwise, U\ brings good news the fact that the agent is 
efficient and a 2 brings bad news, since it is more likely that the agent is inefficient in this 
case. 

Let us adopt the following notations for the ex post information rents: tin = t(9_, a±) — 
Oq{O,0i), U12 = t(9,a 2 )-9q(9,a 2 ), u 21 = t(0,<7i)-0g(0,<7i), and u 22 = t(9,a 2 ) -9q(9,a 2 ). 
Similar notations are used for the outputs qjj. The agent discovers his type and plays the 
mechanism before the signal o realizes. Then the incentive and participation constraints 
must be written in expectation over the realization of a. Incentive constraints for both 



types write respectively as 

Hxiiu + (1 - Hi)u 12 ^ ^i{u 2 i + A9q 21 ) + (1 - /ii)(w 22 + A#<?22) (13.74) 

(1 - fi 2 )u 21 + fi 2 u 22 ^ (1 - /i 2 )(wn - A0<7n) + fi 2 (u 12 - A9q 12 ). (13.75) 

Participation constraints for both types are written as 

friiu + (1 - fJ,i)u 12 ^ 0, (13.76) 

(1 - fi 2 )u 2 i + /i 2 w 22 ^ 0. (13.77) 



Note that, for a given schedule of output g^, the system (13.74) through (13.77) has as 
many equations as unknowns Uij. When the determinant of the coefficient matrix of the 
system (13.74) to (13.77) is nonzero, one can find ex post rents (or equivalent transfers) 
such that all these constraints are binding. In this case, the agent receives no rent whatever 
his type. Moreover, any choice of production levels, in particular the complete information 
optimal ones, can be implemented this way. Note that the determinant of the system is 
nonzero when 

1-Mi-M2^0 (13.78) 
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that fails only if [i\ — /j 2 = 3, which corresponds to the case of an uninformative and 
useless signal. 



13.14.2 Ex Ante Nonverifiable Signal 

Now suppose that a nonverifiable binary signal a about 6 is available to the principal at 
the ex ante stage. Before offering an incentive contract, the principal computes, using 
the Bayes law, his posterior belief that the agent is efficient for each value of this signal, 
namely 

h = Pr{6 = BJa = a,) = — Vfi \ -, (13.79) 

z> 2 = p r (6 = 9/a = a 2 ) = — ^ - ^ . . (13.80) 

- /ii) + (1 - v)nt 

Then the optimal contract entails a downward distortion of the inefficient agents produc- 
tion q SB (<7i)) which is for signals <7i, and a 2 respectively: 

s\f°M) - S + = S + {l _:;i_^ o (13.81) 

S'(q SB M) = S + t^A9 = S + y'f'W (13.82) 

l—I/ 2 (1 — V)ll2 

In the case where fj,i — /i2 — A* > \i we can interpret /x as an index of the informativeness 
of the signal. Observing o\ , the principal thinks that it is more likely that the agent is 
efficient. A stronger reduction in q and thus in the efficient type's information rent is 
called for after a±. (13.81) shows that incentives decrease with respect to the case without 
informative signal since (^jz^ > 1^- In particular, if ji is large enough, the principal shuts 
down the inefficient firm after having observed o\. The principal offers a high-powered 
incentive contract only to the efficient agent, which leaves him with no rent. On the 
contrary, because he is less likely to face an efficient type after having observed 02, the 
principal reduces less of the information rent than in the case without an informative 
signal since < lj. Incentives are stronger. 

13.15 Contract Theory at Work 

This section proposes several classical settings where the basic model of this chapter 
is useful. Introducing adverse selection in each of these contexts has proved to be a 
significative improvement of standard microeconomic analysis. 
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13.15.1 Regulation 

In the Baron and Myerson (Econometrica, 1982) regulation model, the principal is a 
regulator who maximizes a weighted average of the agents' surplus S(q) — t and of a 
regulated monopoly's profit U = t — 9q, with a weight a < 1 for the firms profit. The 
principal's objective function is written now as V = S(q) — 6q — (1 — a)U. Because a < 1, 
it is socially costly to give up a rent to the firm. Maximizing expected social welfare 
under incentive and participation constraints leads to q SB = q* for the efficient type and 
a downward distortion for the inefficient type, q SB < q* which is given by 

S'(q SB ) = + -^-(i- a )A6. (13.83) 

Note that a higher value of a reduces the output distortion, because the regulator is 
less concerned by the distribution of rents within society as a increases. If a — 1, the 
firm's rent is no longer costly and the regulator behaves as a pure efficiency maximizer 
implementing the first-best output in all states of nature. 

The regulation literature of the last thirty years has greatly improved our understand- 
ing of government intervention under asymmetric information. We refer to the book of 
Laffont and Tirole (1993) for a comprehensive view of this theory and its various impli- 
cations for the design of real world regulatory institutions. 

13.15.2 Nonlinear Pricing by a Monopoly 

In Maskin and Riley (Rand J. of Economics, 1984), the principal is the seller of a private 
good with production cost cq who faces a continuum of buyers. The principal has thus a 
utility function V = t — cq. The tastes of a buyer for the private good are such that his 
utility function is U — 9u(q) — t, where q is the quantity consumed and t his payment to 
the principal. Suppose that the parameter 9 of each buyer is drawn independently from 
the same distribution on 9 = {9, 9} with respective probabilities 1 — v and v. 

We are now in a setting with a continuum of agents. However, it is mathematically 
equivalent to the framework with a single agent. Now v is the frequency of type 9 by the 
Law of Large Numbers. 

Incentive and participation constraints can as usual be written directly in terms of the 
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information rents U_ = 9u(q) — t and U = 9u(q) — t as 



U^U - A9u(q), 



(13.84) 



U ^U + A9u(q), 



(13.85) 



o, 



(13.86) 



[/ > 0. 



(13.87) 



The principal's program now takes the following form: 



max v(9u(q) + (1 — v)(9u(q) — cq) — (uU + (1 — v 

{(U,q);(U,q)} 



subject to (13.84) to (13.87). 

The analysis is the mirror image of that of the standard model discussed before, where 
now the efficient type is the one with the highest valuation for the good 9. Hence, (13.85) 
and (13.86) are the two binding constraints. As a result, there is no output distortion 
with respect to the first-best outcome for the high valuation type and q SB = q*, where 
9u'(q*) = c. However, there exists a downward distortion of the low valuation agent's 
output with respect to the first-best outcome. We have q SB < q*, where 



So the unit price is not the same if the buyers demand q* or q s , hence the expression 
of nonlinear prices. 

13.15.3 Quality and Price Discrimination 

Mussa and Rosen (JET, 1978) studied a very similar problem to the nonlinear pricing, 
where agents buy one unit of a commodity with quality q but are vertically differentiated 
with respect to their preferences for the good. The marginal cost (and average cost) 
of producing one unit of quality q is C(q) and the principal has the payoff function 
V — t — C(q). The payoff function of an agent is now U = 9q — t with 9 in = {9, 9}, 
with respective probabilities 1 — v and v. 

Incentive and participation constraints can still be written directly in terms of the 
information rents U_ = 9q — t and U = 9q — i as 




(13.88) 



U^U- A9q, 



(13.89) 



419 



U ^U + A9q, 



(13.90) 



(13.91) 



U>0. 



(13.92) 



The principal solves now: 




x v(9q - C{q)) + (1 - v)(6q - C(q)) - (uU + (1 - u)U) 
U,g)} 



subject to (13.89) to (13.92). 

Following procedures similar to what we have done so far, only (13.90) and (13.91) are 
binding constraints. Finally, we find that the high valuation agent receives the first-best 
quality q SB = q* where 9 = C'(q*). However, quality is now reduced below the first-best 
for the low valuation agent. We have q SB < q*, where 



Interestingly, the spectrum of qualities is larger under asymmetric information than under 
complete information. This incentive of the seller to put a low quality good on the market 
is a well-documented phenomenon in the industrial organization literature. Some authors 
have even argued that damaging its own goods may be part of the firm's optimal selling 
strategy when screening the consumers' willingness to pay for quality is an important 
issue. 

13.15.4 Financial Contracts 

Asymmetric information significantly affects the financial markets. For instance, in a 
paper by Freixas and Laffont (1990), the principal is a lender who provides a loan of size 
Ho a borrower. Capital costs Rk to the lender since it could be invested elsewhere in 
the economy to earn the risk-free interest rate R. The lender has thus a payoff function 
V — t — Rk. The borrower makes a profit U = 9f(k) — t where 9f(k) is the production 
with k units of capital and t is the borrowers repayment to the lender. We assume that 
/' > and /" < 0. The parameter 9 is a productivity shock drawn from = {9,9} with 
respective probabilities 1 — v and v. 



e_ = c \q SB ) + 



V 



A9 and 9 = C'(q*) 



(13.93) 



1 - v 
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Incentive and participation constraints can again be written directly in terms of the 
borrower's information rents U_ = Of(k) — t and U = Of(k) — i as 



U^U-AOf(k), (13.94) 

U + AOf(k), (13.95) 

U^O, (13.96) 

U^O. (13.97) 



The principal's program takes now the following form: 

max v(0f(k) - Rk) + (1 - v)(9f(k)) - Rk) - (uU + (1 - u)U) 

{(U,k);(U,k)} 

subject to (13.94) to (13.97). 

One can check that (13.95) and (13.96) are now the two binding constraints. As a 
result, there is no capital distortion with respect to the first-best outcome for the high 
productivity type and k SB = k* where Of'(k*) = R. In this case, the return on capital 
is equal to the risk-free interest rate. However, there also exists a downward distortion 
in the size of the loan given to a low productivity borrower with respect to the first-best 
outcome. We have k SB < k* where 

^0 - YZr^ A ^j f(k SB ) = R and 9f'(k*) = R. (13.98) 
13.15.5 Labor Contracts 

Asymmetric information also undermines the relationship between a worker and the firm 
for which he works. In Green and Kahn (QJE, 1983) and Hart (RES, 1983), the principal 
is a union (or a set of workers) providing its labor force I to a firm. 

The firm makes a profit 9f(l) —t, where f(l) is the return on labor and t is the worker's 
payment. We assume that /' > and /" < 0. The parameter 9 is a productivity shock 
drawn from 6 = {0, 9} with respective probabilities \ — v and v. The firm's objective is to 
maximize its profit U = Of (I) —t. Workers have a utility function defined on consumption 
and labor. If their disutility of labor is counted in monetary terms and all revenues from 
the firm are consumed, they get V — v(t — I) where / is their disutility of providing I units 
of labor and v(-) is increasing and concave (v' > 0, v" < 0). 
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In this context, the firm's boundaries are determined before the realization of the shock 
and contracting takes place ex ante. It should be clear that the model is similar to the 
one with a risk- averse principal and a risk-neutral agent. So, we know that the risk-averse 
union will propose a contract to the risk-neutral firm which provides full insurance and 
implements the first-best levels of employments J and I* defined respectively by Of (I*) = 1 
and Of (I*) = 1. 

When workers have a utility function exhibiting an income effect, the analysis will 
become much harder even in two-type models. For details, see Laffont and Martimort 
(2002). 

13.16 The Optimal Contract with a Continuum of 
Types 

In this section, we give a brief account of the continuum type case. Most of the principal- 
agent literature is written within this framework. 

Reconsider the standard model with in = [0_, 0} , with a cumulative distribution 
function F(0) and a density function f{0) > on [0,0]. Since the revelation principle is 
still valid with a continuum of types, and we can restrict our analysis to direct revelation 
mechanisms {(q(0),t(0))}, which are truthful, i.e., such that 

t(0) - 0q(0) ^ t(0) - 0q(0) for any (0, 0) G 2 . (13.99) 

In particular, (13.99) implies 

t(0)-0q(0) ^ t(0')-0q(0'), (13.100) 
t(0') - 0'q(0') ^ t(0) - 0'q(6) for all pairs (0,0') G 2 . (13.101) 

Adding (13.100) and (13.101) we obtain 

(0-0')(q(0') -q(0)) ^ 0. (13.102) 

Thus, incentive compatibility alone requires that the schedule of output g(-) has to be 
nonincreasing. This implies that q(-) is differentiable almost everywhere. So we will 
restrict the analysis to differentiable functions. 

422 



(13.99) implies that the following first-order condition for the optimal response 9 cho- 
sen by type 9 is satisfied 

i(9) -9q(9) =0. (13.103) 
For the truth to be an optimal response for all 9, it must be the case that 

i(9) -9q(9) =0, (13.104) 

and (13.104) must hold for all 9 in G since 9 is unknown to the principal. 
It is also necessary to satisfy the local second-order condition, 

i(0)\e=e-m0)\e=e^0 (13.105) 

or 

f(0) - 9q{9) ^ 0. (13.106) 



But differentiating (13.104), (13.106) can be written more simply as 

-q{9) ^ 0. (13.107) 

(13.104) and (13.107) constitute the local incentive constraints, which ensure that the 
agent does not want to lie locally. Now we need to check that he does not want to lie 
globally either, therefore the following constraints must be satisfied 

t{9) - 9q{9) ^ t{9) - 9q{9) for any {9, 9) e 6 2 . (13.108) 

From (13.104) we have 

t(9) - t{9) = / Tq(r)dT = 8q{6) - 0q{6) - \ q(r)dr (13.109) 
Je Je 

or 

t{9) - 9q{9) = t{9) - 9q{9) + {9- 9)q{9) - I q(r)dr, (13.110) 

Je 

where (9 — 9)q(9) — Js q(r)dr ^ 0, because q(-) is nonincreasing. 

So, it turns out that the local incentive constraints (13.104) also imply the global 
incentive constraints. 

In such circumstances, the infinity of incentive constraints (13.108) reduces to a differ- 
ential equation and to a monotonicity constraint. Local analysis of incentives is enough. 
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Truthful revelation mechanisms are then characterized by the two conditions (13.104) and 
(13.107). 

Let us use the rent variable U(9) — t(9) — 9q(9). The local incentive constraint is now 
written as (by using (13.104)) 

U(Q) = - q (0). (13.111) 
The optimization program of the principal becomes 



max 

{(U(-),q(-))} 



f\s(q(9)) - 9q{9) - U{9))f{9)d9 (13.112) 
Je 



subject to 



U{9) = -q(9), (13.113) 
q(9) ^ 0, (13.114) 
U{9) ^ 0. (13.115) 

Using (13.113), the participation constraint (13.115) simplifies to U(9) ^ 0. As in the 
discrete case, incentive compatibility implies that only the participation constraint of the 
most inefficient type can be binding. Furthermore, it is clear from the above program 
that it will be binding, i.e., U(9) = 0. 

Momentarily ignoring (13.114), we can solve (13.113) 

U{9) - U(9) = - [ q(r)dT (13.116) 
Je 

or, since U (9) = 0, 

U(9) = [ q(r)dT (13.117) 



The principal's objective function becomes 

S(q(9)) - 9q{9) - J" q(r)dr ) f(9)d9, (13.118) 
which, by an integration of parts, gives 

[ (s{q{9)) -{9 + ^ q{9)^ f(9)d9. (13.119) 
Maximizing pointwise (13.119), we get the second-best optimal outputs 

S'(q SB (9)) = 9+^, (13.120) 
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which is the first order condition for the case of a continuum of types. 

If the monotone hazard rate property j| (fpj) = holds, the solution q SB (9) of 
(13.120) is clearly decreasing, and the neglected constraint (13.114) is satisfied. All types 
choose therefore different allocations and there is no bunching in the optimal contract. 

From (13.120), we note that there is no distortion for the most efficient type (since 
F(9) = and a downward distortion for all the other types. 

All types, except the least efficient one, obtain a positive information rent at the 
optimal contract 

U SB {9) = / q SB (r)dr. (13.121) 
Je 

Finally, one could also allow for some shutdown of types. The virtual surplus S(q) — 
- j q decreases with 9 when the monotone hazard rate property holds, and shut- 
down (if any) occurs on an interval [9*, 9]. 9* is obtained as a solution to 



max 
{6 



fie { S{qSB{9)) ~{ e+ W)) qSB(9) ) md °- 



For an interior optimum, we find that 



S{q SB {9*)) = [0* + ^ j q SB {9*). 

As in the discrete case, one can check that the Inada condition S"(0) = +oo and the 
condition lim g ^ S'(q)q = ensure the corner solution 9* = 9. 

Remark 13.16.1 The optimal solution above can also be derived by using the Pontryagin 
principle. The Hamiltonian is then 

H(q, U, /i, 9) = (S(q) - 9q - U)f(9) - m , (13.122) 

where \i is the co-state variable, U the state variable and q the control variable, 
From the Pontryagin principle, 

PjTT 

m = - w = /(*)■ ( l3 - l2 3) 

From the transversatility condition (since there is no constraint on U(-) at 9), 

/2(9) = 0. (13.124) 
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Integrating (13.123) using (13.124), we get 

^(0) = F(6). (13.125) 
Optimizing with respect to q(-) also yields 

S'( q SB (0))=0+ f j^ y (13.126) 
and inserting the value of /i(9) obtained from (13.125) again yields(13.120). 

We have derived the optimal truthful direct revelation mechanism {(q SB (0), U SB (9))} 
or {(q SB (0), t SB (6))}. It remains to be investigated if there is a simple implementation of 
this mechanism. Since q SB (-) is decreasing, we can invert this function and obtain 6 SB (q). 



Then, 



becomes 



t SB {9) = U SB {6) + 9q SB {9) (13.127) 



T(q) = t SB (9 SB (q)) = I q SB {r)dr + 6{q)q. (13.128) 

Je{ q ) 

To the optimal truthful direct revelation mechanism we have associated a nonlinear 
transfer T(q). We can check that the agent confronted with this nonlinear transfer chooses 
the same allocation as when he is faced with the optimal revelation mechanism. Indeed, 
wehwe f q (T(q) -6q) = T\q) -6 = d ^ ■ d ^ -6 = 0, since *g- - 0*g- = 0. 

To conclude, the economic insights obtained in the continuum case are not different 
from those obtained in the two-state case. 



13.17 Further Extensions 

The main theme of this chapter was to determine how the fundamental conflict between 
rent extraction and efficiency could be solved in a principal-agent relationship with ad- 
verse selection. In the models discussed, this conflict was relatively easy to understand 
because it resulted from the simple interaction of a single incentive constraint with a 
single participation constraint. Here we would mention some possible extensions. 

One can consider a straightforward three-type extension of the standard model. One 
can also deal with a bidimensional adverse selection model, a two-type model with type- 
dependent reservation utilities, random participation constraints, the limited liability con- 
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straints, and the audit models. For detailed discussion about these topics and their ap- 
plications, see Laffont and Martimort (2002). 
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Chapter 14 

Optimal Mechanism Design: 
Contracts with One- Agent and 
Hidden Action 

14.1 Introduction 

In the previous chapter, we stressed that the delegation of tasks creates an information 
gap between the principal and his agent when the latter learns some piece of information 
relevant to determining the efficient volume of trade. Adverse selection is not the only 
informational problem one can imagine. Agents may also choose actions that affect the 
value of trade or, more generally, the agent's performance. The principal often loses any 
ability to control those actions that are no longer observable, either by the principal who 
offers the contract or by the court of law that enforces it. In such cases we will say that 
there is moral hazard. 

The leading candidates for such moral hazard actions are effort variables, which pos- 
itively influence the agent's level of production but also create a disutility for the agent. 
For instance the yield of a field depends on the amount of time that the tenant has spent 
selecting the best crops, or the quality of their harvesting. Similarly, the probability that 
a driver has a car crash depends on how safely he drives, which also affects his demand 
for insurance. Also, a regulated firm may have to perform a costly and nonobservable 
investment to reduce its cost of producing a socially valuable good. 
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As in the case of adverse selection, asymmetric information also plays a crucial role 
in the design of the optimal incentive contract under moral hazard. However, instead of 
being an exogenous uncertainty for the principal, uncertainty is now endogenous. The 
probabilities of the different states of nature, and thus the expected volume of trade, now 
depend explicitly on the agent's effort. In other words, the realized production level is 
only a noisy signal of the agent's action. This uncertainty is key to understanding the 
contractual problem under moral hazard. If the mapping between effort and performance 
were completely deterministic, the principal and the court of law would have no difficulty 
in inferring the agent's effort from the observed output. Even if the agent's effort was not 
observable directly, it could be indirectly contracted upon, since output would itself be 
observable and verifiable. 

We will study the properties of incentive schemes that induce a positive and costly 
effort. Such schemes must thus satisfy an incentive constraint and the agent's participa- 
tion constraint. Among such schemes, the principal prefers the one that implements the 
positive level of effort at minimal cost. This cost minimization yields the characterization 
of the second-best cost of implementing this effort. In general, this second-best cost is 
greater than the first-best cost that would be obtained by assuming that effort is observ- 
able. An allocative inefficiency emerges as the result of the conflict of interests between 
the principal and the agent. 

14.2 Basic Settings of Principal- Agent Model with 
Moral Hazard 

14.2.1 Effort and Production 

We consider an agent who can exert a costly effort e. Two possible values can be taken by 
e, which we normalize as a zero effort level and a positive effort of one: e in {0, 1}. Exerting 
effort e implies a disutility for the agent that is equal to ip(e) with the normalization 
■0(0) = ipo = and ipi = ip. 

The agent receives a transfer t from the principal. We assume that his utility function 
is separable between money and effort, U = u(t) —ip(e), with u(-) increasing and concave 
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(u' > 0, u" < 0). Sometimes we will use the function h = the inverse function of u(-), 
which is increasing and convex (h! > 0,h" > 0). 

Production is stochastic, and effort affects the production level as follows: the s- 
tochastic production level q can only take two values {q,q}, with q — q = Aq > 0, 
and the stochastic influence of effort on production is characterized by the probabilities 
Pr(g = q\e = 0) = 7r , and Pr(g = q\e = 1) = 7Ti, with 7Ti > 7r . We will denote the 
difference between these two probabilities by An = i\\ — n . 

Note that effort improves production in the sense of first-order stochastic dominance, 
i.e., Pr(g ^ g*|e) is decreasing with e for any given production q*. Indeed, we have Pr(g ^ 
q\e = 1) = 1 — TV i < 1 — 7r = Pr(g ^ q\e = 0) and Pr(q ^ q\e = 1) = 1 =Pr(g ^ q\e = 0). 

14.2.2 Incentive Feasible Contracts 

Since the agent's action is not directly observable by the principal, the principal can only 
offer a contract based on the observable and verifiable production level, i.e., a function 
{t(q)} linking the agent's compensation to the random output q. With two possible 
outcomes q and q, the contract can be defined equivalently by a pair of transfers t and t. 
Transfer t (resp. t) is the payment received by the agent if the production q (resp. q) is 
realized. 

The risk-neutral principal's expected utility is now written as 

V 1 = miSiq) - 1) + (l - 7n)(%) - 1) (14.1) 

if the agent makes a positive effort (e = 1) and 

V = ir (S(q) - t) + (1 - 7T )(%) - t) (14.2) 

if the agent makes no effort (e = 0). For notational simplicity, we will denote the princi- 
pal's benefits in each state of nature by S(q) = S and S(q) = S_. 

Each level of effort that the principal wishes to induce corresponds to a set of contracts 
ensuring moral hazard incentive compatibility constraint and participation constraint are 
satisfied: 

ir lU (t) + (1 - m)u(t) - V ^ *ou(t) + (1 - ir )u(t) (14.3) 
7Tiu(t) + (1 - 7ri)u(t) - ip ^ 0. (14.4) 
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Note that the participation constraint is ensured at the ex ante stage, i.e., before the 
realization of the production shock. 



Definition 14.2.1 An incentive feasible contract satisfies the incentive compatibility and 
participation constraints (14-3) and (14-4)- 

The timing of the contracting game under moral hazard is summarized in the figure 
below. 



/ = 



P offers 
a contract 

{(T,0) 



t~ l 



t = 2 



t = 3 



f = 4 



I J 



iime 



A 

accepts 
or refuses 
the contract 



A The outcome The contract 

exerts an q \ s executed 

effort or not is realized 



Figure 14.1: Timing of contracting under moral harzard. 



14.2.3 The Complete Information Optimal Contract 

As a benchmark, let us first assume that the principal and a benevolent court of law can 
both observe effort. Then, if he wants to induce effort, the principal's problem becomes 

max7ri(5-*) + (l-7Ti)(S:-*) (14.5) 
{(*'*)} 

subject to (14.4). 

Indeed, only the agents participation constraint matters for the principal, because the 
agent can be forced to exert a positive level of effort. If the agent were not choosing this 
level of effort, the agent could be heavily punished, and the court of law could commit to 
enforce such a punishment. 

Denoting the multiplier of this participation constraint by A and optimizing with 
respect to t and t yields, respectively, the following first-order conditions: 

-7Ti + A7ri«'(f) = 0, (14.6) 

-(l-7r 1 ) + A(l-7r 1 ) M / (f) = 0, (14.7) 
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where t* and t* are the first-best transfers. 

From (14.6) and (14.7) we immediately derive that A = = > 0, and finally 
that t*=i*=t*. 

Thus, with a verifiable effort, the agent obtains full insurance from the risk- neutral 
principal, and the transfer t* he receives is the same whatever the state of nature. Because 
the participation constraint is binding we also obtain the value of this transfer, which is 
just enough to cover the disutility of effort, namely t* = h(ip). This is also the expected 
payment made by the principal to the agent, or the first-best cost C FB of implementing 
the positive effort level. 

For the principal, inducing effort yields an expected payoff equal to 

V 1 = mS + (1 - 7n)5 - h(tp) (14.8) 

Had the principal decided to let the agent exert no effort, e , he would make a zero 
payment to the agent whatever the realization of output. In this scenario, the principal 
would instead obtain a payoff equal to 

V = n S+(l-n )S. (14.9) 

Inducing effort is thus optimal from the principal's point of view when V\ ^ Vq, i.e., 
niS + (1 — tti)S_ — h(ip) ^ n S + (1 — vr )5, or to put it differently, when the expected 
gain of effect is greater than first-best cost of inducing effect, i.e., 

AnA^^h^ (14.10) 

where AS = S - S > 0. 

Denoting the benefit of inducing a strictly positive effort level by B = An AS, the 
first-best outcome calls for e* = 1 if and only if B > h(ip), as shown in the figure below. 



14.3 Risk Neutrality and First-Best Implementation 

If the agent is risk-neutral, we have (up to an afline transformation) u(t) = t for all t 
and h(u) = u for all u. The principal who wants to induce effort must thus choose the 
contract that solves the following problem: 

max m(S -t) + (l- m)(S - t) 
{(?,*)} 
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e* = 



First-Best Effort 



-»> Benefit 



Figure 14.2: First-best level of effort. 

mi + (l - m)t - i) ^ n i + (l - n )t (14.11) 

TTit + (1 — TTi)* — V ^ 0. (14.12) 

With risk neutrality the principal can, for instance, choose incentive compatible trans- 
fers t and t, which make the agent's participation constraint binding and leave no rent to 
the agent. Indeed, solving (14.11) and (14.12) with equalities, we immediately obtain 

f = (14.13) 



and 



f- = i^ = r + ^,. (1414) 



The agent is rewarded if production is high. His net utility in this state of nature 
U* — t* — ip — ^^^> > 0. Conversely, the agent is punished if production is low. His 
corresponding net utility U* = t* — ip = — < 0. 

The principal makes an expected payment ntf* + (1 — nijt* = ip, which is equal to 
the disutility of effort he would incur if he could control the effort level perfectly. The 
principal can costlessly structure the agent's payment so that the latter has the right 
incentives to exert effort. Using (14.13) and (14.14), his expected gain from exerting 
effort is thus A7r(t* — t*) — ip when increasing his effort from e = to e = 1. 

Proposition 14.3.1 Moral hazard is not an issue with a risk-neutral agent despite the 
nonobserv ability of effort. The first-best level of effort is still implemented. 

Remark 14.3.1 One may find the similarity of these results with those described last 
chapter. In both cases, when contracting takes place ex ante, the incentive constraint, 
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under either adverse selection or moral hazard, does not conflict with the ex ante participa- 
tion constraint with a risk- neutral agent, and the first-best outcome is still implemented. 

Remark 14.3.2 Inefficiencies in effort provision due to moral hazard will arise when the 
agent is no longer risk-neutral. There are two alternative ways to model these transaction 
costs. One is to maintain risk neutrality for positive income levels but to impose a limited 
liability constraint, which requires transfers not to be too negative. The other is to 
let the agent be strictly risk-averse. In the following, we analyze these two contractual 
environments and the different trade-offs they imply. 



14.4 The Trade-Off Between Limited Liability Rent 
Extraction and Efficiency 

Let us consider a risk-neutral agent. As we have already seen, (14.3) and (14.4) now take 
the following forms: 

ttJ + (1 - 7Ti)* - i> ^ n i + (1 - ir )t (14.15) 

and 

■Kit + (1 - 7Ti)t - i> ^ 0. (14.16) 

Let us also assume that the agent's transfer must always be greater than some exoge- 
nous level — /, with I ^ 0. Thus, limited liability constraints in both states of nature are 
written as 

t^-l (14.17) 

and 

t^-l. (14.18) 

These constraints may prevent the principal from implementing the first-best level of 
effort even if the agent is risk-neutral. Indeed, when he wants to induce a high effort, the 
principal's program is written as 

max7Ti(5-*) + (l-7Ti)(S:-*) (14.19) 
{(*'*)} 

subject to (14.15) to (14.18). 

Then, we have the following proposition. 
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Proposition 14.4.1 With limited liability, the optimal contract inducing effort from the 
agent entails: 

(1) Fori > j^ip, only (14-15) and (14-16) are binding. Optimal transfers are 
given by (14-13) and (14-14)- The agent has no expected limited liability 
rent; EU SB = 0. 

(2) For ^ I ^ J^V^ (14-15) an d (14-18) are binding. Optimal transfers are 
then given by: 

t SB = -I, (14.20) 
i SB = -l + 4 L - (14.21) 

(3) Moreover, the agent's expected limited liability rent EU SB is non-negative: 

EU SB = irit SB + (1 - 7Ti)f SB - i> = -I + ^ 0. (14.22) 

A7T 

Proof. First suppose that ^ I ^ ^^.We conjecture that (14.15) and (14.18) are 
the only relevant constraints. Of course, since the principal is willing to minimize the 
payments made to the agent, both constraints must be binding. Hence, t SB = —I and 
t SB = -I + JjL. We check that (14.17) is satisfied since -I + ^ > -I. We also check that 
(14.16) is satisfied since nit SB + (1 - ni)t_ SB - $ = -I + j^ip ^ 0. 

For I > j^ip, note that the transfers t* = —j^ip, and t* — —tp + > t* are 

such that both limited liability constraints (14.17) and (14.18) are strictly satisfied, and 
(14.15) and (14.16) are both binding. In this case, it is costless to induce a positive effort 
by the agent, and the first-best outcome can be implemented. The proof is completed. 

Note that only the limited liability constraint in the bad state of nature may be 
binding. When the limited liability constraint (14.18) is binding, the principal is limited 
in his punishments to induce effort. The risk-neutral agent does not have enough assets 
to cover the punishment if q is realized in order to induce effort provision. The principal 
uses rewards when a good state of nature q is realized. As a result, the agent receives a 
non-negative ex ante limited liability rent described by (14.22). Compared with the case 
without limited liability, this rent is actually the additional payment that the principal 
must incur because of the conjunction of moral hazard and limited liability. 
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As the agent becomes endowed with more assets, i.e., as I gets larger, the conflict 
between moral hazard and limited liability diminishes and then disappears whenever / is 
large enough. 



14.5 The Trade-Off Between Insurance and Efficien- 
cy 

Now suppose the agent is risk-averse. The principal's program is written as: 

max7ri(5-*) + (l-7Ti)(S:-*) (14.23) 
{(*>*)} 

subject to (14.3) and (14.4). 

Since the principal's optimization problem may not be a concave program for which 
the first-order Kuhn and Tucker conditions are necessary and sufficient, we make the 
following change of variables. Define u = u(t) and u = u(t), or equivalently let i = h{u) 
and t— h(u). These new variables are the levels of ex post utility obtained by the agent 
in both states of nature. The set of incentive feasible contracts can now be described by 
two linear constraints: 

7TiM + (1 - 7Ti)u - ^ = ^OM + (1 - 7To)M, (14.24) 

ix x u + (1 - m)u - ip ^ 0, (14.25) 

which replaces (14.3) and (14.4), respectively. 

Then, the principal's program can be rewritten as 

max tti(5 - h(u)) + (1 - m)(S - h(u)) (14.26) 
{(«>«)} 

subject to (14.24) and (14.25). 

Note that the principal's objective function is now strictly concave in (u, u) because 
h(-) is strictly convex. The constraints are now linear and the interior of the constrained 
set is obviously non-empty. 

14.5.1 Optimal Transfers 

Letting A and fi be the non-negative multipliers associated respectively with the con- 
straints (14.24) and (14.25), the first-order conditions of this program can be expressed 
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as 



-Tnh'(u SB ) + AAtt + fim = + AAtt + fim = 0, (14.27) 

u [t ) 



-(1 - n±)ti{u bB ) - AAtt + //(l - tti) = - v - AAtt + /i(l - m) = 0. (14.28) 

u [t ) 

where i SB and t are the second-best optimal transfers. Rearranging terms, we get 



A=^^(^--^), (14-32) 



_^_ = /i + A — , (14.29) 
— — = n — A— . (14.30) 

U (t ) 1 - 7Ti V 7 

The four variables (t 5B , t 5B , A,/i) are simultaneously obtained as the solutions to the 
system of four equations (14.24), (14.25), (14.29), and (14.30). Multiplying (14.29) by ir 1 
and (14.30) by 1 — n 1 , and then adding those two modified equations we obtain 

" = ^ + ^pj >0 ' < 14 - 31) 

Hence, the participation constraint (14.16) is necessarily binding. Using (14.31) and 
(14.29), we also obtain 

7ri (l-7n) / 1 1 

Att " \u'(i SB ) u'(t & 

where A must also be strictly positive. Indeed, from (14.24) we have u SB — u SB ^ > 

and thus t SB > t SB , implying that the right-hand side of (14.32) is strictly positive since 

u" < 0. Using that (14.24) and (14.25) are both binding, we can immediately obtain the 

values of u(i SB ) and u(t SB ) by solving a system of two equations with two unknowns. 

Note that the risk-averse agent does not receive full insurance anymore. Indeed, with 

full insurance, the incentive compatibility constraint (14.3) can no longer be satisfied. 

Inducing effort requires the agent to bear some risk, the following proposition provides a 

summary. 

Proposition 14.5.1 When the agent is strictly risk-averse, the optimal contract that 
induces effort makes both the agent's participation and incentive constraints binding. This 
contract does not provide full insurance. Moreover, second-best transfers are given by 

i SB = h(^ + (l- m)^) =h( (14.33) 

and 



A7T / V Att 
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14.5.2 The Optimal Second-Best Effort 

Let us now turn to the question of the second-best optimality of inducing a high effort, 
from the principal's point of view. The second-best cost C SB of inducing effort under 
moral hazard is the expected payment made to the agent C SB = nii SB + (1 — ni)t SB . 
Using (14.33) and (14.34), this cost is rewritten as 



C»» = n 1 h\^ + (l-n 1 )^j+(l-n 1 )h^-—j 

= ^( 1 A?^)+( 1 -^(-^)- (1435) 
The benefit of inducing effort is still B = AnAS , and a positive effort e* = 1 is the 
optimal choice of the principal whenever 

= «ih(^^ (14.36) 
e* = e * = \ 





e SB = 
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Figure 14.3: Second-best level of effort with moral hazard and risk aversion. 

With h(-) being strictly convex, Jensen's inequality implies that the right-hand side 
of (14.36) is strictly greater than the first-best cost of implementing effort C FB = h(ip). 
Therefore, inducing a higher effort occurs less often with moral hazard than when effort 
is observable. The above figure represents this phenomenon graphically. 

For B belonging to the interval [C FB , C SB ] , the second-best level of effort is zero and 
is thus strictly below its first-best value. There is now an under-provision of effort because 
of moral hazard and risk aversion. 

Proposition 14.5.2 With moral hazard and risk aversion, there is a trade-off between 
inducing effort and providing insurance to the agent. In a model with two possible levels 
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of effort, the principal induces a positive effort from the agent less often than when effort 
is observable. 



14.6 More than Two Levels of Performance 

We now extend our previous 2x2 model to allow for more than two levels of performance. 
We consider a production process where n possible outcomes can be realized. Those 
performances can be ordered so that q\ < qi <•••<<&<...< q n . We denote the 
principal's return in each of those states of nature by Si = S(qi). In this context, a 
contract is a n-tuple of payments {(ti, . . . ,t n )}. Also, let be the probability that 
production qi takes place when the effort level is e^. We assume that -k^ for all pairs 
(i,k) with Yli=i n ik = 1- Finally, we keep the assumption that only two levels of effort 
are feasible, i.e., e k in {0, 1}. We still denote Aiii = tth — n i0 . 

14.6.1 Limited Liability 

Consider first the limited liability model. If the optimal contract induces a positive effort, 
it solves the following program: 

n 

max J^TTaiSi-ti) (14.37) 
{(ti,...,t„)} ^— * 
t=i 

subject to 

n 

(14.38) 

i=i 

n 

^(vra-vr,o)t^^ (14.39) 
i=i 

U^O, for alH G {l,...,n}. (14.40) 

(14.38) is the agent's participation constraint. (14.39) is his incentive constraint. 
(14.40) are all the limited liability constraints by assuming that the agent cannot be 
given a negative payment. 

First, note that the participation constraint (14.38) is implied by the incentive (14.39) 
and the limited liability (14.40) constraints. Indeed, we have 

n n n 

nnti - ^ = - nio)u - ip + ^2 nioti = °- 

1=1 1=1 1=1 

v v ' v v ' 
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Hence, we can neglect the participation constraint (14.38) in the optimization of the 
principal's program. 

Denoting the multiplier of (14.39) by A and the respective multipliers of (14.40) by 
the first-order conditions lead to 

71*11 + AAtt, + & = 0. (14.41) 

with the slackness conditions = for each % in {1, . . . , n}. 

For such that the second-best transfer tf B is strictly positive, £j = 0, and we must 
have A = — ^ — for any such %. If the ratios nil all different, there exists a single 

Til-TiO J 7Tjl ' ° 

index j such that - '^J^ is the highest possible ratio. The agent receives a strictly 
positive transfer only in this particular state of nature j, and this payment is such that 
the incentive constraint (14.39) is binding, i.e., tj B = 7r . 1 ^ 7r . - In all other states, the agent 
receives no transfer and tf B = for all i ^ j. Finally, the agent gets a strictly positive 

SB _ KjQip 



ex ante limited liability rent that is worth EU 

The important point here is that the agent is rewarded in the state of nature that is the 
most informative about the fact that he has exerted a positive effort. Indeed, nil ~* i0 can 
be interpreted as a likelihood ratio. The principal therefore uses a maximum likelihood 
ratio criterion to reward the agent. The agent is only rewarded when this likelihood 
ratio is maximized. Like an econometrician, the principal tries to infer from the observed 
output what has been the parameter (effort) underlying this distribution. But here the 
parameter is endogenously affected by the incentive contract. 

Definition 14.6.1 The probabilities of success satisfy the monotone likelihood ratio prop- 
erty (MLRP) if Wil ~™ i0 is nondecreasing in i. 

Proposition 14.6.1 If the probability of success satisfies MLRP, the second-best payment 
tf B received by the agent may be chosen to be nondecreasing with the level of production 
ft- 
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14.6.2 Risk Aversion 

Suppose now that the agent is strictly risk-averse. The optimal contract that induces 
effort must solve the program below: 

n 

max y^niiSi - U) (14.42) 

{*i,...,t„)} £ —. • 
i=i 

subject to 

n n 

i=i i=i 

and 

n 

^tt^^-^O, (14.44) 
i=i 

where the latter constraint is the agent's participation constraint. 

Using the same change of variables as before, it should be clear that the program is 
again a concave problem with respect to the new variables Ui = u(ti). Using the same 
notations as before, the first-order conditions of the principal's program are written as: 

_L_^ = /i + A f 7 ^!^] for alH e {1, . . . , n}. (14.45) 

U \ t i) V n H J 

Multiplying each of these equations by tth and summing over i yields /i — E q (^ u ,^sB^ j > 0, 
where E q denotes the expectation operator with respect to the distribution of outputs 
induced by effort e = 1. 

Multiplying (14.45) by irnu(tf B ), summing all these equations over i, and taking into 
account the expression of \i obtained above yields 

A (X> - n.oHtf^ = E q (u{tD - E (^))) • (14-46) 

Using the slackness condition A (X^i^i — n io) u (tf B ) ~ V') = to simplify the left- 
hand side of (14.46), we finally get 

\4> = cov(u(t?%-^y (14.47) 

By assumption, u(-) and u'(-) covary in opposite directions. Moreover, a constant wage 
tf B = t SB for all i does not satisfy the incentive constraint, and thus tf B cannot be 
constant everywhere. Hence, the right-hand side of (14.47) is necessarily strictly positive. 
Thus we have A > 0, and the incentive constraint is binding. 
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Coming back to (14.45), we observe that the left-hand side is increasing in tf B since 
u(-) is concave. For tf B to be nondecreasing with i, MLRP must again hold. Then higher 
outputs are also those that are the more informative ones about the realization of a high 
effort. Hence, the agent should be more rewarded as output increases. 

14.7 Contract Theory at Work 

This section elaborates on the moral hazard paradigm discussed so far in a number of 
settings that have been discussed extensively in the contracting literature. 

14.7.1 Efficiency Wage 

Let us consider a risk-neutral agent working for a firm, the principal. This is a basic 
model studied by Shapiro and Stiglitz (AER, 1984). By exerting effort e in {0,1}, the 
firm's added value is V (resp. V_) with probability n(e) (resp. 1 — vr(e)). The agent can 
only be rewarded for a good performance and cannot be punished for a bad outcome, 
since they are protected by limited liability. 

To induce effort, the principal must find an optimal compensation scheme {(£,£)} that 
is the solution to the program below: 

max7ri(V"-*) + (l-7ri)(V:-t) (14.48) 

{(*>*)} 

subject to 

mi + (1 - 7Ti)t - i> ^ n i + (1 - 7r )t, (14.49) 

mi + (1 - tti)£ - ip ^ 0, (14.50) 

t^O. (14.51) 

The problem is completely isomorphic to the one analyzed earlier. The limited liability 
constraint is binding at the optimum, and the firm chooses to induce a high effort when 
AvrAV ^ ^J. At the optimum, t SB = and i SB > 0. The positive wage i SB = ^ is 
often called an efficiency wage because it induces the agent to exert a high (efficient) level 
of effort. To induce production, the principal must give up a positive share of the firm's 
profit to the agent. 
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14.7.2 Sharecropping 

The moral hazard paradigm has been one of the leading tools used by development e- 
conomists to analyze agrarian economies. In the sharecropping model given in Stiglitz 
(RES, 1974), the principal is now a landlord and the agent is the landlord's tenant. By 
exerting an effort e in {0, 1}, the tenant increases (decreases) the probability 7r(e) (resp. 
1 — 7r(e)) that a large q (resp. small q) quantity of an agricultural product is produced. 
The price of this good is normalized to one so that the principal's stochastic return on 
the activity is also q or q, depending on the state of nature. 

It is often the case that peasants in developing countries are subject to strong financial 
constraints. To model such a setting we assume that the agent is risk neutral and protected 
by limited liability. When he wants to induce effort, the principal's optimal contract must 
solve 



max7Ti((z-f) + (l-7ri)(g-£) (14.52) 

{(t,t)} /v - 



subject to 



mi + (1 - 7Ti)t - V ^ vr t + (1 - 7r )t (14.53) 

■Kit + (1 - m)t - i> ^ 0, (14.54) 

t ^ 0. (14.55) 

The optimal contract therefore satisfies t SB = and i SB = This is again akin to 
an efficiency wage. The expected utilities obtained respectively by the principal and the 
agent are given by 

EV SB = mq + il - irih- 7 ^- (14-56) 



and 



EU SB = ?t (14.57) 



The flexible second-best contract described above has sometimes been criticized as 
not corresponding to the contractual arrangements observed in most agrarian economies. 
Contracts often take the form of simple linear schedules linking the tenant's production 
to his compensation. As an exercise, let us now analyze a simple linear sharing rule 
between the landlord and his tenant, with the landlord offering the agent a fixed share a 
of the realized production. Such a sharing rule automatically satisfies the agent's limited 
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liability constraint, which can therefore be omitted in what follows. Formally, the optimal 
linear rule inducing effort must solve 



max(l -a)(7Tig + (l -7Ti)g) (14.58) 

a — 

subject to 

a^q + (1 - 7Ti)g) - *p ^ a(n q + (1 - ir )q), (14.59) 

afaq + (1 - irjq) - if> ^ (14.60) 

Obviously, only (14.59) is binding at the optimum. One finds the optimal linear sharing 
rule to be 

a SB = (14.61) 

Note that a SB < 1 because, for the agricultural activity to be a valuable venture in 
the first-best world, we must have AnAq > ip. Hence, the return on the agricultural 
activity is shared between the principal and the agent, with high-powered incentives (a 
close to one) being provided when the disutility of effort tp is large or when the principal's 
gain from an increase of effort A7rAg is small. 

This sharing rule also yields the following expected utilities to the principal and the 
agent, respectively 

EV a = mq + (1 - m)q - ^ + ^~ ^ (14.62) 

and 

/ iTiq + (1 - 7Ti)g\ ib 

Comparing (14.56) and (14.62) on the one hand and (14.57) and (14.63) on the other 
hand, we observe that the constant sharing rule benefits the agent but not the principal. 
A linear contract is less powerful than the optimal second-best contract. The former 
contract is an inefficient way to extract rent from the agent even if it still provides sufficient 
incentives to exert effort. Indeed, with a linear sharing rule, the agent always benefits from 
a positive return on his production, even in the worst state of nature. This positive return 
yields to the agent more than what is requested by the optimal second-best contract in 
the worst state of nature, namely zero. Punishing the agent for a bad performance is thus 
found to be rather difficult with a linear sharing rule. 
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A linear sharing rule allows the agent to keep some strictly positive rent EU a . If 
the space of available contracts is extended to allow for fixed fees /3, the principal can 
nevertheless bring the agent down to the level of his outside opportunity by setting a fixed 



14.7.3 Wholesale Contracts 

Let us now consider a manufacturer-retailer relationship studied in Laffont and Tirole 
(1993). The manufacturer supplies at constant marginal cost c an intermediate good to 
the risk-averse retailer, who sells this good on a final market. Demand on this market is 
high (resp. low) D(p) (resp. D(p)) with probability 7r(e) where, again, e is in {0, 1} and 
p denotes the price for the final good. Effort e is exerted by the retailer, who can increase 
the probability that demand is high if after-sales services are efficiently performed. The 
wholesale contract consists of a retail price maintenance agreement specifying the prices 
p and p on the final market with a sharing of the profits, namely {(t,p); (t,p)}. When 
he wants to induce effort, the optimal contract offered by the manufacturer solves the 
following problem: 



subject to (14.3) and (14.4). 

The solution to this problem is obtained by appending the following expressions of the 
retail prices to the transfers given in (14.33) and (14.34): p* + = c, and p* + = 
c. Note that these prices are the same as those that would be chosen under complete 
information. The pricing rule is not affected by the incentive problem. 

14.7.4 Financial Contracts 

Moral hazard is an important issue in financial markets. In Holmstrom and Tirole (AER, 
1994), it is assumed that a risk-averse entrepreneur wants to start a project that requires 
an initial investment worth an amount I. The entrepreneur has no cash of his own and 
must raise money from a bank or any other financial intermediary. The return on the 
project is random and equal to V (resp. V) with probability n(e) (resp. 1 — 7r(e)), where 
the effort exerted by the entrepreneur e belongs to {0, 1}. We denote the spread of profits 




{(t,p);{t,p)} 



max 7Ti((p - c)D(p) -t) + (l- 7Ti)((p - c)D(p) - t) 



(14.64) 
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by AV = V — V_ > 0. The financial contract consists of repayments {(z,z)}, depending 
upon whether the project is successful or not. 

To induce effort from the borrower, the risk-neutral lender's program is written as 

max 7TiZ + (1 — ni)z — I 

subject to 

niu{V -z) + (l- 7Ti)u(V - 

^ 71 U(V-Z) + (1 -TToHV- 

n lU (V -z) + (l- ir^uiV -z)-ip^0. (14.67) 

Note that the project is a valuable venture if it provides the bank with a positive expected 
profit. 

With the change of variables, i = V — z and t = V_ — z, the principal's program takes 
its usual form. This change of variables also highlights the fact that everything happens 
as if the lender was benefitting directly from the returns of the project, and then paying 
the agent only a fraction of the returns in the different states of nature. 

Let us define the second-best cost of implementing a positive effort C SB , and let us 
assume that AttAI^ ^ C SB , so that the lender wants to induce a positive effort level even 
in a second-best environment. The lender's expected profit is worth 

V x = mV + (1 - tti)E - C SB - I. (14.68) 

Let us now parameterize projects according to the size of the investment /. Only the 
projects with positive value V\ > will be financed. This requires the investment to be 
low enough, and typically we must have 

/</ 5B = 7ri y + (l- 7ri )V-C Si? . (14.69) 

Under complete information and no moral hazard, the project would instead be fi- 
nanced as soon as 

I < I* = ni V + (1 - 7n)V (14.70) 

For intermediary values of the investment, i.e., for / in [I SB , I*}, moral hazard implies 
that some projects are financed under complete information but no longer under moral 
hazard. This is akin to some form of credit rationing. 
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z)-il> 



(14.65) 



;i4.66) 



Finally, note that the optimal financial contract offered to the risk-averse and cashless 
entrepreneur does not satisfy the limited liability constraint t ^ 0. Indeed, we have t SB = 
h(ip — ^) < 0. To be induced to make an effort, the agent must bear some risk, which 
implies a negative payoff in the bad state of nature. Adding the limited liability constraint, 
the optimal contract would instead entail t LL = and i LL = h (^)- Interestingly, this 
contract has sometimes been interpreted in the corporate finance literature as a debt 
contract, with no money being left to the borrower in the bad state of nature and the 
residual being pocketed by the lender in the good state of nature. 

Finally, note that 

- t LL_ t LL = h ^^ < pB_ t SB = h ^ + ( 1 _^^ (14.71) 

since h(-) is strictly convex and h(0) = 0. This inequality shows that the debt contract 
has less incentive power than the optimal incentive contract. Indeed, it becomes harder 
to spread the agent's payments between both states of nature to induce effort if the agent 
is protected by limited liability by the agent, who is interested only in his payoff in the 
high state of nature, only rewards are attractive. 



14.8 A Continuum of Performances 

Let us now assume that the level of performance q is drawn from a continuous distribution 
with a cumulative function F(-|e) on the support [q,q]. This distribution is conditional 
on the agent's level of effort, which still takes two possible values e in {0, 1}. We denote 
by f(-\e) the density corresponding to the above distributions. A contract t(q) inducing 
a positive effort in this context must satisfy the incentive constraint 

9 u(t(q))f(q\l)dq-^^ f ' u(t(q))f(q\0)dq, (14.72) 



and the participation constraint 



q u(t(q))f(q\l)dq-i>^0. (14.73) 



The risk-neutral principal problem is thus written as 

max [ q (S(q)-t(q))f(q\l)dq, (14.74) 
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subject to (14.72) and (14.73). 

Denoting the multipliers of (14.72) and (14.73) by A and //, respectively, the Lagrangian 
is written as 



L(q, t) = (S(q) - t)f(q\l) + X(u(t)(f(q\l) - f(q\0)) - i>) + »(u(t) f (q\l) - j,). 

Optimizing pointwise with respect to t yields 

_! ., + ) («. (14 . 75) 
u'(t SB (q)) r V / 

Multiplying (14.75) by fi(q) and taking expectations, we obtain, as in the main text, 

/* = E i ( 7777^7^) > 0, (14-76) 



u'{t SB {q)) 

where Eg(-) is the expectation operator with respect to the probability distribution of 
output induced by an effort e SB . Finally, using this expression of /z, inserting it into 
(14.75), and multiplying it by f{q\l)u(t SB (q)), we obtain 



MJ(q\l)-f(q\0))u(t SB (q)) (14.77) 

Integrating over [q,q] and taking into account the slackness condition X(J^(f(q\l) — 
f(q\0)Ht SB (q))dq - i;) = yields Xip = cov(u(t SB (q)), ^ W) ) Z 0. 

Hence, A ^ because u(-) and u'(-) vary in opposite directions. Also, A = only 
if t SB (q) is a constant, but in this case the incentive constraint is necessarily violated. 
As a result, we have A > 0. Finally, t SB (n) is monotonically increasing in it when the 
monotone likelihood property ^ ( ^^^/(gfi*)^^^ ) = i s satisfied. 

14.9 Further Extension 

We have stressed the various conflicts that may appear in a moral hazard environment. 
The analysis of these conflicts, under both limited liability and risk aversion, was made 
easy because of our focus on a simple 2x2 environment with a binary effort and two levels 
of performance. The simple interaction between a single incentive constraint with either 
a limited liability constraint or a participation constraint was quite straightforward. 
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When one moves away from the 2x2 model, the analysis becomes much harder, and 
characterizing the optimal incentive contract is a difficult task. Examples of such complex 
contracting environment are abound. Effort may no longer be binary but, instead, may be 
better characterized as a continuous variable. A manager may no longer choose between 
working or not working on a project but may be able to fine-tune the exact effort spent 
on this project. Even worse, the agent's actions may no longer be summarized by a one- 
dimensional parameter but may be better described by a whole array of control variables 
that are technologically linked. For instance, the manager of a firm may have to choose 
how to allocate his effort between productive activities and monitoring his peers and other 
workers. 

Nevertheless, one can extend the standard model to the cases where the agent can 
perform more than two and possibly a continuum of levels of effort, to the case with 
a multitask model, the case where the agent's utility function is no longer separable 
between consumption and effort. One can also analyze the trade-off between efficiency 
and redistribution in a moral hazard context. For detailed discussion, see Chapter 5 of 
Laffont and Martimort (2002). 
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Chapter 15 

General Mechanism Design: 
Contracts with Multi- Agents 

15.1 Introduction 

In the previous chapters on the principal-agent theory, we have introduced basic models 
to explain the core of the principal-agent theory with complete contracts. It highlights the 
various trade-offs between allocative efficiency and the distribution of information rents. 
Since the model involves only one agent, the design of the principal's optimal contract has 
reduced to a constrained optimization problem without having to appeal to sophisticated 
game theory concepts. 

In this chapter, we will introduce some of basic results and insights of the mechanism 
design in general, and implementation theory in particular for situations where there is 
one principal (also called the designer) and several agents. In such a case, asymmetric 
information may not only affect the relationship between the principal and each of his 
agents, but it may also plague the relationships between agents. To describe the strategic 
interaction between agents and the principal, the game theoretic reasoning is thus used 
to model social institutions as varied voting systems, auctions, bargaining protocols, and 
methods for deciding on public projects. 

Incentive problems arise when the social planner cannot distinguish between things 
that are indeed different so that free-ride problem may appear. A free rider can improve 
his welfare by not telling the truth about his own un-observable characteristic. Like the 
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principal-agent model, a basic insight of the incentive mechanism with more than one a- 
gent is that incentive constraints should be considered coequally with resource constraints. 
One of the most fundamental contributions of the mechanism theory has been shown that 
the free-rider problem may or may not occur, depending on the kind of game (mechanism) 
that agents play and other game theoretical solution concepts. A theme that comes out of 
the literature is the difficulty of finding mechanisms compatible with individual incentives 
that simultaneously results in a desired social goal. 

Examples of incentive mechanism design that takes strategic interactions among agents 
exist for a long time. An early example is the Biblical story of the famous judgement of 
Solomon for determining who is the real mother of a baby. Two women came before the 
King, disputing who was the mother of a child. The King's solution used a method of 
threatening to cut the lively baby in two and give half to each. One women was willing 
to give up the child, but another women agreed to cut in two. The King then made his 
judgement and decision: The first woman is the mother, do not kill the child and give the 
him to the first woman. Another example of incentive mechanism design is how to cut a 
pie and divide equally among all participants. 

The first major development was in the work of Gibbard-Hurwicz-Satterthwaite in 
1970s. When information is private, the appropriate equilibrium concept is dominant 
strategies. These incentives adopt the form of incentive compatibility constraints where 
for each agent to tell truth about their characteristics must be dominant. The fundamental 
conclusion of Gibbard-Hurwicz-Satterthwaite's impossibility theorem is that we have to 
have a trade-off between the truth-telling and Pareto efficiency (or the first best outcomes 
in general). Of course, if one is willing to give up Pareto efficiency, we can have a truth- 
telling mechanism, such as Vickery- Clark- Groves mechanism. In many cases, one can 
ignore the first-best or Pareto efficiency, and so one can expect the truth-telling behavior. 

On the other hand, we could give up the truth-telling requirement, and want to reach 
Pareto efficient outcomes. When the information about the characteristics of the agents 
is shared by individuals but not by the designer, then the relevant equilibrium concept 
is the Nash equilibrium. In this situation, one can gives up the truth-telling, and uses 
a general message space. One may design a mechanism that Nash implements Pareto 
efficient allocations. 
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We will introduce these results and such trade-offs. We will also discuss the case of 
incomplete information in which agents do not know each other's characteristics, and we 
need to consider Bayesian incentive compatible mechanism. 

15.2 Basic Settings 

Theoretical framework of the incentive mechanism design consists of five components: (1) 
economic environments (fundamentals of economy); (2) social choice goal to be reached; 
(3) economic mechanism that specifies the rules of game; (4) description of solution con- 
cept on individuals' self-interested behavior, and (5) implementation of a social choice 
goal (incentive-compatibility of personal interests and the social goal at equilibrium). 

15.2.1 Economic Environments 

Ci = (Zi,Wi, >pi,Yi): economic characteristic of agent i which consists of out- 
come space, initial endowment if any, preference relation, and the produc- 
tion set if agent i is also a producer; 

e = (ei, . . . , e n ): an economy; 

E: The set of all priori admissible economic environments. 

U — U\ x . . . x U n : The set of all admissible utility functions. 

= 9iX0 2 • • -x0 n : The set of all admissible parameters 9 = • • • , 9i) G 
that determine types of parametric utility functions and so it is 

called the space of types or called the state of the world. 

Remark 15.2.1 Here, E is a general expression of economic environments. However, 
depending on the situations facing the designer, the set of admissible economic environ- 
ments under consideration sometimes may be just given by E = U, E = 0, or by the set 
of all possible initial endowments, or production sets. 

The designer is assumed that he does not know individuals' economic characteris- 
tics. The individuals may or may not know the characteristics of the others. If they 
know, it is called the complete information case, otherwise it is called the incomplete 
information case. 
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For simplicity (but without loss of generality), when individuals' economic character- 
istics are private information (the case of incomplete information), we assume preferences 
are given by parametric utility functions. In this case, each agent i privately observes 
a type $i € Gj, which determines his preferences over outcomes. The state 8 is drawn 
randomly from a prior distribution with density </?(•) that can also be probabilities for fi- 
nite 6. Each agent maximizes von Neumann- Morgenstern expected utility over outcomes, 
given by (Bernoulli) utility function Ui(y,6i). Thus, Information structure is specified 
by 

(1) &i is privately observed by agent i; 

(2) {ui(-, -)}r=i 1S common knowledge; 

(3) </?(•) is common knowledge. 

In this chapter, we will first discuss the case of complete information and then the 
case of incomplete information. 

15.2.2 Social Goal 

Given economic environments, each agent participates economic activities, makes deci- 
sions, receives benefits and pays costs on economic activities. The designer wants to reach 
some desired goal that is considered to be socially optimal by some criterion. 
Let 

Z — Z 1 x . . . x Z n : the outcome space (For example, Z = X x Y) . 
A C Z: the feasible set. 

F : E —>■—>■ A: the social goal or called social choice correspondence in which 
F(e) is the set of socially desired outcomes at the economy under some 
criterion of social optimality. 

Examples of Social Choice Correspondences: 

P(e): the set of Pareto efficient allocations. 
1(e): the set of individual rational allocations. 
W(e): the set of Walrasian allocations. 
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L(e): the set of Lindahl allocations. 
FA(e): the set of fare allocations. 

When F becomes a single- valued function, denoted by /, it is called a social choice 
function. 

Examples of Social Choice Functions: 
Solomon's goal. 
Majority voting rule. 

15.2.3 Economic Mechanism 

Since the designer lacks the information about individuals' economic characteristics, he 
needs to design an appropriate incentive mechanism (contract or rules of game) to coordi- 
nate the personal interests and the social goal, i.e., under the mechanism, all individuals 
have incentives to choose actions which result in socially optimal outcomes when they 
pursue their personal interests. To do so, the designer informs how the information he 
collected from individuals is used to determine outcomes, that is, he first tells the rules 
of games. He then uses the information or actions of agents and the rules of game to 
determine outcomes of individuals. Thus, a mechanism consists of a message space and 
an outcome function. Let 

Mi : the message space of agent i. 

M = Mi x . . . x M n : the message space in which communications take place. 

TOj e Mf a message reported by agent i. 

m = (mi, . . . ,m n ) G M: a profile of messages. 

h : M — )■ Z: outcome function that translates messages into outcomes, 
r =< M, h >: a mechanism 

That is, a mechanism consists of a message space and an outcome function. 

Remark 15.2.2 A mechanism is often also referred to as a game form. The terminol- 
ogy of game form distinguishes it from a game in game theory in number of ways. (1) 
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Mechanism design is normative analysis in contrast to game theory, which is positive e- 
conomics. Game theory is important because it predicts how a given game will be played 
by agents. Mechanism design goes one step further: given the physical environment and 
the constraints faced by the designer, what goal can be realized or implemented? What 
mechanisms are optimal among those that are feasible? (2) The consequence of a profile 
of message is an outcome in mechanism design rather than a vector of utility payoff- 
s. Of course, once the preference of the individuals are specified, then a game form or 
mechanism induces a conventional game. (3) The preferences of individuals in the mech- 
anism design setting vary, while the preferences of a game takes as given. This distinction 
between mechanisms and games is critical. Because of this, an equilibrium (dominant 
strategy equilibrium) in mechanism design is much easier to exist than a game. (4) In 
designing mechanisms one must take into account incentive constraints in a way that 
personal interests are consistent to the goal that a designer want to implement it. 

Remark 15.2.3 In the implementation (incentive mechanism design) literature, one re- 
quires a mechanism be incentive compatible in the sense that personal interests are consis- 
tent with desired socially optimal outcomes even when individual agents are self-interested 
in their personal goals without paying much attention to the size of message. In the re- 
alization literature originated by Hurwicz (1972, 1986b), a sub-field of the mechanism 
literature, one also concerns the size of message space of a mechanism, and tries to find 
economic system to have small operation cost. The smaller a message space of a mechanis- 
m, the lower (transition) cost of operating the mechanism. For the neoclassical economies, 
it has been shown that competitive market economy system is the unique most efficient 
system that results in Pareto efficient and individually rational allocations (cf, Mount and 
Reiter (1974), Walker (1977), Osana (1978), Hurwicz (1986b), Jordan (1982), Tian (2004, 
2005)). 

15.2.4 Solution Concept of Self-interested Behavior 

A basic assumption in economics is that individuals are self-interested in the sense that 
they pursue their personal interests. Unless they can be better off, they in general does 
not care about social interests. As a result, different economic environments and different 
rules of game will lead to different reactions of individuals, and thus each individual agent's 
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strategy on reaction will depend on his self-interested behavior which in turn depends on 
the economic environments and the mechanism. 

Let b(e, T) be the set of equilibrium strategies that describes the self-interested behav- 
ior of individuals. Examples of such equilibrium solution concepts include Nash equilib- 
rium, dominant strategy, Bayesian Nash equilibrium, etc. 

Thus, given E, M, h, and b, the resulting equilibrium outcome is the composite 
function of the rules of game and the equilibrium strategy, i.e., h(b(e,T)). 

15.2.5 Implementation and Incentive Compatibility 

In which sense can we see individuals's personal interests do not have conflicts with a 
social interest? We will call such problem as implementation problem. The purpose of 
an incentive mechanism design is to implement some desired socially optimal outcomes. 
Given a mechanism T and equilibrium behavior assumption b(e,T), the implementation 
problem of a social choice rule F studies the relationship of the intersection state of F(e) 
and h(b(e,r)), which can be illustrated by the following diagram. 




M 

Figure 15.1: Diagrammatic Illustration of Mechanism design Problem. 

We have the following various definitions on implementation and incentive compati- 
bility of F. 
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A Mechanism < M, h > is said to 

(i) fully implement a social choice correspondence F in equilibrium strategy 
b(e, T) on E if for every e E E 

(a) b(e, T) 7^ (equilibrium solution exists), 

(b) h(b(e, T)) = F(e) (personal interests are fully consistent with 
social goals); 

(ii) implement a social choice correspondence F in equilibrium strategy b(e, T) 
on E if for every e E E 

(a) 6(e,r)^0, 

(b) #,r))CF(e); 

(hi) weakly implement a social choice correspondence F in equilibrium strategy 
b(e, T) on E if for every e E E 

(a) 6(e,r)^0, 

(b) /i(6(e,r))DF(e) ^ 0. 

A Mechanism < M, h > is said to be b(e, T) incentive- compatible with a social choice 
correspondence F in b(e, r)-equilibrium if it (fully or weakly) implements F in b(e, T)- 
equilibrium. 

Note that we did not give a specific solution concept yet when we define the imple- 
mentability and incentive-compatibility. As shown in the following, whether or not a social 
choice correspondence is implementable will depend on the assumption on the solution 
concept of self-interested behavior. When information is complete, the solution concept 
can be dominant equilibrium, Nash equilibrium, strong Nash equilibrium, subgame perfect 
Nash equilibrium, undominanted equilibrium, etc. For incomplete information, equilibri- 
um strategy can be Bayesian Nash equilibrium, undominated Bayesian Nash equilibrium, 
etc. 
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15.3 Examples 



Before we discuss some basic results in the mechanism theory, we first give some economic 
environments which show that one needs to design a mechanism to solve the incentive 
compatible problems. 

Example 15.3.1 (A Public Project) A society is deciding on whether or not to build 
a public project at a cost c. The cost of the pubic project is shared by individuals. Let 
Si be the share of the cost by i so that ^2 i( z N Sj = 1. 

The outcome space is then Y = {0, 1}, where represents not building the project and 
1 represents building the project. Individual i's value from use of this project is r\. In this 
case, the net value of individual i is from not having the project built and i>j = r j — s^c 
from having a project built. Thus agent i's valuation function can be represented as 

Vi(y, v{) = yn - ysic = yv { . 

Example 15.3.2 (Continuous Public Goods Setting) In the above example, the pub- 
lic good could only take two values, and there is no scale problem. But, in many case, the 
level of public goods depends on the collection of the contribution or tax. Now let y G R + 
denote the scale of the public project and c(y) denote the cost of producing y. Thus, 
the outcome space is Z = R + x R n , and the feasible set is A = {(y,z 1 (y), . . . ,z n {y)) G 
R + x R n : J2ieN z i(u) = c (l/)}> where Zi(y) is the share of agent i for producing the public 
goods y. The benefit of i for building y is r,(y) with rj(0) = 0. Thus, the net benefit of not 
building the project is equal to 0, the net benefit of building the project is r^y) — Zi(y). 
The valuation function of agent i can be written as 

Vi(y) = n{y) - Zi(y). 

Example 15.3.3 (Allocating an Indivisible Private Good) An indivisible good is 
to be allocated to one member of society. For instance, the rights to an exclusive license 
are to be allocated or an enterprise is to be privatized. In this case, the outcome space 
is Z = {y G {0, l} n : Y17=iVi = l}>where yi — 1 means individual i obtains the object, 
yi — represents the individual does not get the object. If individual i gets the object, 
the net value benefitted from the object is i>j. If he does not get the object, his net value 
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is 0. Thus, agent i's valuation function is 

Vi(y) = ViVi. 

Note that we can regard y as n-dimensional vector of public goods since Vi(y) = v^yi = 
v l y, where v l is a vector where the i-th component is Vi and the others are zeros, i.e., 
^ = (0,. .. ,0,^,0,. ..,0). 

From these examples, a socially optimal decision clearly depends on the individuals' 
true valuation function v «(■). For instance, we have shown previously that a public project 
is produced if and only if the total values of all individuals is greater than it total cost, 

i.e., if Y,iaN r i > c ' then V = !» and if Ei G jv r i < c > then V = 0- 

Let Vi be the set of all valuation functions Vi, let V = YlieN let /i : V — >• Z is a 
decision rule. Then ft, is said to be efficient if and only if: 

15.4 Dominant Strategy and Truthful Revelation Mech- 
anisms 

The strongest solution concept of describing self-interested behavior is dominant strategy. 
The dominant strategy identifies situations in which the strategy chosen by each individual 
is the best, regardless of choices of the others. The beauty of this equilibrium concept 
is in the weak rationality it demands of the agents: an agent need not forecast what the 
others are doing. An axiom in game theory is that agents will use it as long as a dominant 
strategy exists. 

For e G E, a mechanism T =< M, h > is said to have a dominant strategy equilibrium 
m* if for all % 

hi(m*,m_i) tpi hi(mi,m-i) for all m G M. (15.1) 

Denote by D(e, T) the set of dominant strategy equilibria for T =< M, h > and e G E. 

Under the assumption of dominant strategy, since each agent's optimal choice does 
not depend on the choices of the others and does not need to know characteristics of the 
others, the required information is least when an individual makes decisions. Thus, if it 
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exists, it is an ideal situation. Another advantage is that bad equilibria are usually not a 
problem. If an agent has two dominant strategies they must be payoff-equivalent, which 
is not a generic property. 

In a given game, dominant strategies are not likely to exist. However, since we are 
designing the game, we can try to ensure that agents do have dominant strategies. 

When the solution concept is given by dominant strategy equilibrium, i.e., b(e,T) = 
D(e,T), a mechanism T =< M,h > implements a social choice correspondence F in 
dominant equilibrium strategy on E if for every e e E, 

(a) L>(e,r)^0; 

(b) h(D(e,T))cF(e). 

The above definitions have applied to general (indirect) mechanisms, there is, however, 
a particular class of game forms which have a natural appeal and have received much 
attention in the literature. These are called direct or revelation mechanisms, in which the 
message space M; for each agent % is the set of possible characteristics E { . In effect, each 
agent reports a possible characteristic but not necessarily his true one. 

A mechanism T =< M, h > is said to be a revelation or direct mechanism if M = E. 

Example 15.4.1 The optimal contracts we discussed in Chapter 13 are revelation mech- 
anisms. 

Example 15.4.2 The Groves mechanism we will discuss below is a revelation mechanism. 

The most appealing revelation mechanisms are those in which truthful reporting of 
characteristics always turns out to be an equilibrium. It is the absence of such a mechanism 
which has been called the "free-rider" problem in the theory of public goods. 

A revelation mechanism < E, h > is said to implements a social choice correspondence 
F truthfully in b(e, T) on E if for every e e E, 

(a) eeb(e,T); 

(b) h(e) C F(e). 

That is, F(-) is truthfully implementable in dominant strategies if truth-telling is a 
dominant strategy for each agent in the direct revelation mechanism. 
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Truthfully implementable in dominant strategies is also called dominant strategy in- 
centive compatible, strategy proof or straightforward. 

Although the message space of a mechanism can be arbitrary, the following Revela- 
tion Principle tells us that one only needs to use the so-called revelation mechanism in 
which the message space consists solely of the set of individuals' characteristics, and it is 
unnecessary to seek more complicated mechanisms. Thus, it will significantly reduce the 
complicity of constructing a mechanism. 

Theorem 15.4.1 (Revelation Principle) Suppose a mechanism < M,h > implements 
a social choice rule F in dominant strategy. Then there is a revelation mechanism < 
E,g > which implements F truthfully in dominant strategy. 

Proof. Let d be a selection of dominant strategy correspondence of the mechanism 
< M,h >, i.e., for every e G E, m* = d(e) G D(e, T). Since T = (M, h) implements social 
choice rule F, such a selection exists by the implementation of F. Since the strategy of 
each agent is independent of the strategies of the others, each agent i's dominant strategy 
can be expressed as m* = dj(ej). 

Define the revelation mechanism < E,g > by g(e) = h(d(e)) for each e G E. We first 
show that the truth-telling is always a dominant strategy equilibrium of the revelation 
mechanism (E, g). Suppose not. Then, there exists a message e' and an agent i such that 

^b(e-,e'_J] > Uilgie^e'^)}. 

However, since g = h o d, we have 

u i [h(d(e' i ),d(e'_ i )]>u i [h(d(e i ),d(e'_ i )\, 

which contradicts the fact that m* = dj(ej) is a dominant strategy equilibrium. This is 
because, when the true economic environment is (e«, e'_j), agent % has an incentive not to 
report m* = dj(ej) truthfully, but have an incentive to report = rfj(e^), a contradiction. 

Finally, since m* = d(e) G D(e,T) and < M,h > implements a social choice rule F 
in dominant strategy, we have g(e) = h(d(e)) = h(m*) G F{e). Hence, the revelation 
mechanism implements F truthfully in dominant strategy. The proof is completed. 

Thus, by the Revelation Principle, we know that, if truthful implementation rather 
than implementation is all that we require, we need never consider general mechanisms. 
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In the literature, if a revelation mechanism < E,h > truthfully implements a social 
choice rule F in dominant strategy, the mechanism T is sometimes said to be strongly 
individually incentive- compatible with a social choice correspondence F. In particular, 
when F becomes a single-valued function /, < E, f > can be regarded as a revelation 
mechanism. Thus, if a mechanism < M, h > implements / in dominant strategy, then the 
revelation mechanism < E, f > is incentive compatible in dominant strategy, or called 
strongly individually incentive compatible. 

Remark 15.4.1 Notice that the Revelation Principle may be valid only for weak imple- 
mentation. It does not apply to full implementation. The Revelation Principle specifies 
a correspondence between a dominant strategy equilibrium of the original mechanism 
< M,h > and the true profile of characteristics as a dominant strategy equilibrium, and 
it does not require the revelation mechanism has a unique dominant equilibrium so that 
the revelation mechanism < E,g > may also exist non-truthful strategy equilibrium that 
does not corresponds to any equilibrium. Thus, in moving from the general (indirect) 
dominant strategy mechanisms to direct ones, one may introduce undesirable dominant 
strategies which are not truthful. More troubling, these additional strategies may create 
a situation where the indirect mechanism is an implantation of a given F, while the di- 
rect revelation mechanism is not. Thus, even if a mechanism implements a social choice 
function, the corresponding revelation mechanism < E,g > may only weakly implement, 
but not implement F. 

15.5 Gibbard-Satterthwaite Impossibility Theorem 

The Revelation Principle is very useful to find a dominant strategy mechanism. If one 
hopes a social choice goal / can be (weakly) implemented in dominant strategy, one only 
needs to show the revelation mechanism < E, f > is strongly incentive compatible. 

However, the more is known a priori about agents' characteristics, the fewer incentive- 
compatibility constraints an implementable choice function has to satisfy, and the more 
likely it is to be implementable. Thus, the worst possible case is when nothing is known 
about the agents' preferences over X. The Gibbard-Satterthwaite impossibility theorem 
in Chapter 9 tells us that, if the domain of economic environments is unrestricted, such 
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a mechanism does not exist unless it is a dictatorial mechanism. From the angle of the 
mechanism design, we restate this theorem here. 

Definition 15.5.1 A social choice function is dictatorial if there exists an agent whose 
optimal choice is the social optimal. 

Now we state the Gibbard-Satterthwaite Theorem without the proof that is very 
complicated. A proof can be found, say, in Salanie's book (2000): Microeconomics of 
Market Failures. 

Theorem 15.5.1 (Gibbard-Satterthwaite Theorem) If X has at least 3 alterna- 
tives, a social choice function which is strongly individually incentive compatible and 
defined on a unrestricted domain is dictatorial. 

15.6 Hurwicz Impossibility Theorem 

The Gibbard-Satterthwaite impossibility theorem is a very negative result. This result is 
essentially equivalent to Arrow's impossibility result. However, as we will show, when the 
admissible set of economic environments is restricted, the result may be positive as the 
Groves mechanism defined on quasi-linear utility functions. Unfortunately, the following 
Hurwicz's impossibility theorem shows the Pareto efficiency and the truthful revelation is 
fundamentally inconsistent even for the class of neoclassical economic environments. 

Theorem 15.6.1 (Hurwicz Impossibility Theorem, 1972) For the neoclassical pri- 
vate goods economies, there is no mechanism < M, h > that implements Pareto efficient 
and individually rational allocations in dominant strategy. Consequently, any revelation 
mechanism < M, h > that yields Pareto efficient and individually rational allocations is 
not strongly individually incentive compatible. (Truth-telling about their preferences is not 
Nash Equilibrium) . 

Proof: By the Revelation Principle, we only need to show that any revelation mecha- 
nism cannot implement Pareto efficient and individually rational allocations truthfully 
in dominant equilibrium for a particular pure exchange economy. In turn, it is enough 
to show that truth-telling is not a Nash equilibrium for any revelation mechanism that 
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yields Pareto efficient and individually rational allocations for a particular pure exchange 
economy. 

Consider a private goods economy with two agents (n = 2) and two goods (L — 2), 

Wl = (0,2),iw 2 = (2,0) 

3xi + yi if Xi < yi 
Xi + tyi if Xi > y { 



Ui(x,y) 




Figure 15.2: An illustration of the proof of Hurwicz's impossibility theorem. 
Thus, feasible allocations are given by 

A = {[(x 1 ,y 1 ),(x 2 ,y 2 )} E R\ : 
xi + x 2 = 2 

y\ + z/2 = 2} 

Let Ui be the set of all neoclassical utility functions, i.e. they are continuous and quasi- 
concave, which agent i can report to the designer. Thus, the true utility function tii e E/j. 
Then, 

U = U x xU 2 
h : U -> A 
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Note that, if the true utility function profile Ui were a Nash Equilibrium, it would satisfy 

Ui(hi(ui,u-i)) ^Ui(hi(ui,u-i)) (15.2) 
We want to show that Ui is not a Nash equilibrium. Note that, 

(1) P(e) = 0\0 2 (contract curve) 

(2) IR(e) n P(e) = a~b 

(3) h(ui,u 2 ) = d e ab 

Now, suppose agent 2 reports his utility function by cheating: 

u 2 (x 2 ,y 2 ) = 2x + y (15.3) 

Then, with u 2 , the new set of individually rational and Pareto efficient allocations is given 
by 

IR(e) D P(e) = ae (15.4) 

Note that any point between a and e is strictly preferred to d by agent 2. Thus, an 
allocation determined by any mechanism which is IR and Pareto efficient allocation under 
(ui,u 2 ) is some point, say, the point c in the figure, between the segment of the line 
determined by a and e. Hence, we have 

U2(h 2 (ui,u 2 )) > u 2 (h 2 (ui, u 2 )) (15.5) 

since h 2 (ui, u 2 ) = c Eae. Similarly, if d is between ae, then agent 1 has incentive to cheat. 
Thus, no mechanism that yields Pareto efficient and individually rational allocations is 
incentive compatible. The proof is completed. 

Thus, the Hurwicz's impossibility theorem implies that Pareto efficiency and the truth- 
ful revelation about individuals' characteristics are fundamentally incompatible. However, 
if one is willing to give up Pareto efficiency, say, one only requires the efficient provision 
of public goods, is it possible to find an incentive compatible mechanism which results 
in the Pareto efficient provision of a public good and can truthfully reveal individuals' 
characteristics? The answer is positive. For the class of quasi-linear utility functions, the 
so-called Vickrey- Clark- Groves Mechanism can be such a mechanism. 
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15.7 Vickrey- Clark- Groves Mechanisms 

From Chapter 12 on public goods, we have known that public goods economies may 
present problems by a decentralized resource allocation mechanism because of the free- 
rider problem. Private provision of a public good generally results in less than an efficient 
amount of the public good. Voting may result in too much or too little of a public good. 
Are there any mechanisms that result in the "right" amount of the public good? This is 
a question of the incentive compatible mechanism design. 

Again, we will focus on the quasi-linear environment, where all agents are known to 
care for money. In this environment, the results are more positive, and we can even im- 
plement the efficient decision rule (the efficient provision of public goods). For simplicity, 
let us first return to the model of discrete public good. 

15.7.1 Vickrey- Clark- Groves Mechanisms for Discrete Public 
Good 

Consider a provision problem of a discrete public good. Suppose that the economy has n 
agents. Let 

c: the cost of producing the public project. 
rf the maximum willingness to pay of %. 
sf the share of the cost by %. 
Vi = Tj — SjC: the net value of i. 

The public project is determined according to 

y= < 

[ otherwise 

From the discussion in Chapter 12, it is efficient to produce the public good, y — 1, if and 
only if 

n n 
i=l i=l 

Since the maximum willingness to pay for each agent, r i; is private information and 
so is the net value Vi, what mechanism one should use to determine if the project is 
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built? One mechanism that we might use is simply to ask each agent to report his or 
her net value and provide the public good if and only if the sum of the reported value is 
positive. The problem with such a scheme is that it does not provide right incentives for 
individual agents to reveal their true willingness-to-pay. Individuals may have incentives 
to underreport their willingness-to-pay. 

Thus, a question is how we can induce each agent to truthfully reveal his true value 
for the public good. The so-called Vickrey-Clark-Groves (VCG) mechanism gives such a 
mechanism. 

Suppose the utility functions are quasi-linear in net increment in private good, Xi — Wi, 
which have the form: 

Ui(xi - Wi, y) = Xi-Wi + ny 
s.t. Xi + g, L y = Wi + ti 

where U is the transfer to agent i. Then, we have 

Ui(t h y) = U + riy-giy 

= U + (n - gi)y 

= U + vty. 

• Groves Mechanism: 

In a Groves mechanism, agents are required to report their net values. Thus the message 
space of each agent % is Mj = 3?. The Groves mechanism is defined as follows: 
r = (M 1 ,...,M n ,t 1 (-)M-),---,t n (-),y(-)) = (M,t(-),y(-)), where 

(1) bi e Mi — R: each agent % reports a "bid" for the public good, i.e., report 
the net value of agent % which may or may not be his true net value i>j. 

(2) The level of the public good is determined by 

otherwise 



y(b) 



(3) Each agent % receives a side payment (transfer) 



T,bj ifEILi^o 

(15.6) 

otherwise 
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Then, the payoff of agent % is given by 

{Vi + U = V; + V . / . bj if V™ , bi > 
J ^ l=1 " (15.7) 
otherwise 

We want to show that it is optimal for each agent to report the true net value, bi = Vi, 
regardless of what the other agents report. That is, truth-telling is a dominant strategy 
equilibrium. 

Proposition 15.7.1 The truth-telling is a dominant strategy under the Groves- Clark 
mechanism. 

Proof: There are two cases to be considered. 

Case 1: Vi + Ylij^i^j > 0. Then agent i can ensure the public good is provided by 
reporting bi = i>j. Indeed, if bi = Vi, then Ylj^i bj + v i — Y^i=i bj > an d thus y = 1. In 
this case, <pi{v u = v { + Y^j^i bj > 0. 

Case 2: Vi + Yj^i bj ^0. Agent i can ensure that the public good is not provided by 
reporting bi = Vi so that Y^i=i ^ = 0- ^ n this case, (pi(vi, = ^ v j + Ylj^i bj- 

Thus, for either cases, agent i has incentives to tell the true value of v «. Hence, it is 
optimal for agent i to tell the truth. There is no incentive for agent i to misrepresent his 
true net value regardless of what other agents do. 

The above preference revelation mechanism has a major fault: the total side-payment 
may be very large. Thus, it is very costly to induce the agents to tell the truth. 

Ideally, we would like to have a mechanism where the sum of the side-payment is equal 
to zero so that the feasibility condition holds, and consequently it results in Pareto efficient 
allocations, but in general it impossible by Hurwicz's impossibility theorem. However, we 
could modify the above mechanism by asking each agent to pay a "tax" , but not receive 
payment. Because of this "waster" tax, the allocation of public goods is still not Pareto 
efficient. 

The basic idea of paying a tax is to add an extra amount to agent i's side-payment, 
di(b-i) that depends only on what the other agents do. 

A General Groves Mechanism: Ask each agent to pay additional tax, 
In this case, the transfer is given by 




Zj^bj - di(b_i) ifELi^o 

-di(b-i) if£r=A<° 
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otherwise 



(15.8) 



The payoff to agent % now takes the form: 

Vi + U- di(b-i) = i 
-di(b-i) 

For exactly the same reason as for the mechanism above, one can prove that it is 
optimal for each agent i to report his true net value. 

If the function di(b-i) is suitably chosen, the size of the side-payment can be signif- 
icantly reduced. A nice choice was given by Clarke (1971). He suggested a particular 
Groves mechanism known as the Clarke mechanism (also called Pivot mechanism): 

The Pivotal Mechanism is a special case of the general Groves Mechanism in which 
diib-i) is given by 

' E j7 ^ if£,^o 

if bj < 



di(6_i) 



In this case, it gives 



U(b) = < 



if ElU^Oand £^&^0, 

bj if ElU^Oand £^<0 

"E^A' if £ILA<0and E^fc^O 

if £r=i fc, < and ^ < 



(15.9) 



i.e., 



U{b) 



(15.10) 



Mb) 



(15.11) 



-lE^^I if (EILAXE^M < o 

-lE^^I if Er=i^ = 0andE^^<0 
otherwise 
Therefore, the payoff of agent i 

ifELi&^O and £^ 6^0 
+ ifEILi^O and£,^<0 

-£^A ifE"=i^<0 andE^fc^O 
ifEILA<0 and£.^<0 

Remark 15.7.1 Thus, from the transfer given in (15.10), adding in the side-payment 
has the effect of taxing agent i only if he changes the social decision. Such an agent is 
called the pivotal person. The amount of the tax agent % must pay is the amount by which 
agent i's bid damages the other agents. The price that agent % must pay to change the 
amount of public good is equal to the harm that he imposes on the other agents. 
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15.7.2 The Vickrey-Clark-Groves Mechanisms with Continuous 
Public Goods 

Now we are concerned with the provision of continuous public goods. Consider a public 
goods economy with n agents, one private good, and K public goods. Denote 

the consumption of the private good by i; 
y: the consumption of the public goods by all individuals; 
U: transfer payment to i; 
gi(y): the contribution made by i; 

c(y): the cost function of producing public goods y that satisfies the condition: 

Then, agent i's budget constraint should satisfy 

Xi + gi(y) = wi + U (15.12) 
and his utility functions are given by 

u i (x i -w i ,y) = x i -w i + u i (y) (15.13) 

By substitution, 

Ui(ti,y) = U + (ui(y) - giiy)) 
= U + Vi(y) 

where Vi(y) is called the valuation function of agent i. From the budget constraint, 

n n n 

^2{xi + 9i(y)} = ^2w i + ^2t i (15.14) 

i=l l=i l=i 

we have 

n n n 

1=1 1=1 1=1 

The feasibility (or balanced) condition then becomes 



^2u = (15.16) 



i=i 
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Recall that Pareto efficient allocations are completely characterized by 

max^2 aiUi(xi,y) 

n n 

s.t. ^Xi + ciy) = ^2' 



Wi 

l=i l=i 



For quasi-linear utility functions it is easily seen that the weights a« must be the same 
for all agents since Oj = A for interior Pareto efficient allocations (no income effect), the 
Lagrangian multiplier, and y is thus uniquely determined for the special case of quasi- 
linear utility functions Ui(ti,y) — U + Vi(y). Then, the above characterization problem 
becomes 

^2{U + Vi(y)) 



max 

U,y 



i=i 



;i5.i7) 



or equivalently 
(1) 



max 

y 



i=i 



(2) (feasibility condition): Y^=i^i = 
Then, the Lindahl-Samuelson condition is given by: 



dvi(y) 

i=i 

that is, 



y- dvi{y) = 



dujjy) = dc(y) 
^ d Vk dy k 

Thus, Pareto efficient allocations for quasi-linear utility functions are completely char- 
acterized by the Lindahl-Samuelson condition Y17=i v 'i(v) = ® anc ^ feasibility condition 

5Xi*i = o. 

In a Groves mechanism, it is supposed that each agent is asked to report the valuation 
function Vi(y). Denote his reported valuation function by bi(y). 

To get the efficient level of public goods, the government may announce that it will 
provide a level of public goods y* that maximizes 



max 

y 



j>(v) 



i=l 
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The Groves mechanism has the form: 



Y = (V,h) (15.18) 

where V = V\ x . . . x V n : is the message space that consists of the set of all possible 
valuation functions with element bi(y) G Vi, h = (ti(6), t 2 (6), . . . ,t n (b),y(b)) are outcome 
functions. It is determined by: 

(1) Ask each agent % to report his/her valuation function bi(y) which may or 
may not be the true valuation Vi(y). 

(2) Determine y*: the level of the public goods, y* = y(b), is determined by 

n 

maxVfei(y) (15.19) 
y ^ 

(3) Determine t: transfer of agent i, U is determined by 

U = J2W) (15.20) 

The payoff of agent % is then given by 

<M%*)) = Vi (y*) + U(b) = Vi (y*) + W) (15.21) 

The social planner's goal is to have the optimal level y* that solves the problem: 

n 

maxY]&i(y). 
y ^ 

In which case is the individual's interest consistent with social planner's interest? 
Under the rule of this mechanism, it is optimal for each agent i to truthfully report his 
true valuation function bi(y) = Vi(y) since agent i wants to maximize 

Vi(y) + ^2b j (y). 

By reporting fej(y) = Vi(y), agent % ensures that the government will choose y* which 
also maximizes his payoff while the government maximizes the social welfare. That 
is, individual's interest is consistent with the social interest that is determined by the 
Lindahl-Samuelson condition. Thus, truth-telling, bi(y) = Vi(y)), is a dominant strategy 
equilibrium. 
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In general, Y17=i^i(Hv)) ^ 0> which means that a Groves mechanism in general does 
not result in Pareto efficient outcomes even if it satisfies the Lindahl-Samuelson condition, 
i.e., it is Pareto efficient to provide the public goods. 

As in the discrete case, the total transfer can be very large, just as before, they can 
be reduced by an appropriate side-payment. The Groves mechanism can be modified to 

t i (b) = ^2b j (y)-d i (b- i ). 

The general form of the Groves Mechanism is then T =< V,t,y(b) > such that 

(1) EtMy(b))^El 1 Uy)ioryeY; 

(2) M*) = E^Mi/) -*(*-*)■ 

A special case of the Groves mechanism is independently described by Clark and is 
called the Clark mechanism (also called the pivotal mechanism) in which is 
given by 

di(b-i) =max Ybjiy). (15.22) 
y *r?. 

That is, the pivotal mechanism, V =< V,t,y(b) >, is to choose (y*,t*) such that 

(1) EtMy*)^El 1 Uy)ioryeY; 

(2) U(b) = £^ bj(y*) - maxj £^ bj(y). 

It is interesting to point out that the Clark mechanism contains the well-known Vickery 
auction mechanism (the second-price auction mechanism) as a special case. Under the 
Vickery mechanism, the highest biding person obtains the object, and he pays the second 
highest biding price. To see this, let us explore this relationship in the case of a single 
good auction (Example 9.3.3 in the beginning of this chapter). In this case, the outcome 
space is 

n 

Z = {ye{0,l} B :5> = l} 

i=i 

where yi — 1 implies that agent i gets the object, and yi — means the person does not 
get the object. Agent i's valuation function is then given by 

Vi(y) = Viyi. 
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Since we can regard y as a n-dimensional vector of public goods, by the Clark mechanism 
above, we know that 

n 

V = 9(P) = {y E Z : max V b^} = {y G Z : max 6*}, 

i=l 

and the truth-telling is a dominate strategy. Thus, if Qiiy) = 1, then tj(t>) = ^j^Vjy* — 
max y^2j^i v jyj = ~ m&Xj^iVj. If <7j(fc) = 0, then tj(t>) = 0. This means that the object 
is allocated to the individual with the highest valuation and he pays an amount equal to 
the second highest valuation. No other payments are made. This is exactly the outcome 
predicted by the Vickery mechanism. 

Do there exist other mechanisms implementing the efficient decision rule y*l The 
answer is no if valuation functions v(y, •) is sufficiently "rich" in a sense. To see this, 
consider the class of parametric valuation functions Vi(y, •) with a continuum type space 
0j = [9, 9]. We first state the following lemma. 

Lemma 15.7.1 At a solution (y(-),t(-)), individual rationalities IRg are not binding for 
all 9 > 9, and I Re is binding at 9. 

Proof. As in the two-type case, single-crossing property (SCP), IRg and ICg imply 
IRg > 0. Now, if IR e were not binding at 9, we could increase t(9) by e > for all 
^ j which would preserve all incentive constraints and increase the Principal' s 
profit, a contradiction. 

Proposition 15.7.2 (Holmstrom (1979)) Suppose that for eachi, 0j C R d is smooth- 
ly connected, Vi(y, 9i) is differentiable in 9{ and the gradient D^v^y, 9i) is uniformly bound- 
ed across (y,9i). Then any mechanism implementing an efficient decision rule y*{-) in 
dominant strategy is a Vickrey- Clark- Groves mechanism. 

Proof. Let Ui(9) = Vi(y*(9),9i) -\- U{9) denote agent i's equilibrium utility in the 
mechanism. Fix some 9_ { G Oj, and consider a smooth path 7 from 9_ { to 9{. Applying 
the Envelope Theorem of Milgrom and Segal (2000, Theorem 2) along path 7, agent Vs 
dominant-strategy incentive-compatibility constraints imply (as in Lemma 15.7.1): 

Jo 
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Let 

n n 

S(0) = $>i(!/*(0),*i) = max^ViiyA)- 

' "VGA ^— ' 

1=1 i=l 

Again, applying the Envelope Theorem of Milgrom and Segal (2000, Theorem 2) to 
this program along path 7, we have 

S(6) = S(M-i) + f D fliVi (y*( 7 (r),e_ i )7(r))d 7 (r). 
Jo 

Subtracting, we get 
and thus 

U{6) = U0)-v l (y*(9),9 i ) = S(e) + d i (6- i )-v i (y*(e),e i ) = X ^(y* (0), Oj) + di(9-i), 

which is the transfer for a Vickrey-Clark-Groves mechanism. 

The intuition behind the proof is that each agent % only has the incentive to tell the 
truth when he is made the residual claimant of total surplus, i.e., Ui(6) = S(9) + 
which requires a VCG mechanism. This result was also obtained by Laffont and Maskin 
(Econometrica, 1980), but under much more restrictive assumptions. 1 

15.7.3 Balanced VCG Mechanisms 

Implementation of a surplus-maximizing decision rule y*(-), is a necessary condition for 
(ex-post) Pareto efficiency, but it is not sufficient. To have Pareto efficienc, we must also 
guarantee that there is an ex post Balanced Budget (no waste of numeraire): 

n 

J2ti(9) = 0,V6 G 0. (15.23) 
A trivial case for which we can achieve ex post efficiency is given in the following example. 

Example 15.7.1 If there exists some agent i such that Oj = {9i} a singleton, then we 
have no incentive problem for agent i; and we can set 

*i(0) = -$>(0),V0ee. 

This trivially guarantees a balanced budget. 

1 A particular case of the proposition was also obtained by Green and Laffon (1979). Namely, they 
allow agents to have all possible valuations over a finite decision set X, which can be described by the 
Euclidean type space Oi = K'^' (the valuation of agent i for decision k being represented by 9^)- 
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However, in general, there is no such a positive result. When the set of v(-, •) functions 
is "rich" then there may be no choice rule /(•) = (y*(-), h(-), ■ ■ ■ ,£/(•)) where y*(-) is ex 
post optimal, that is truthfully implementable in dominant, and that satisfies (15.23) so 
it does not result in Pareto efficient outcome. 

To have an example where budget balance cannot be achieved, consider a setting with 
two agents. Under the assumptions of Proposition 15.7.2, any mechanism implementing 
an efficient decision rule y*(-) in dominant strategy is a VCG mechanism, hence 

t i (e) = v- i (y*(e),e- i ) + d i (9- i ). 

Budget balance requires that 

= h(9) + t 2 {6) = v 1 (y*(9), 9 1 ) + v 2 (y*(9), 9 2 ) + d ± (9 2 ) + d 2 (9 1 ). 

Letting S(9) = vi(y*(9),9i) + v 2 (y*(9), 9 2 ) denote the maximum surplus in state 9, we 
must therefore have 

S(9) = -d 1 (9 2 )-d 2 (9 1 ). 

Thus, efficiency can only be achieved when maximal total surplus is additively sepa- 
rable in the agents' types, which is unlikely. 

To have a robust example where additive separability does not hold, consider the 
the "public good setting" in which Y — [y,y] C R with y < y, and for each agent i, 
©i = C M, 9_ { < 9i (to rule out the situation described in the above Example), 

Vi(y, 9i) is differentiable in 9i, dVt ^ J e ' e ^ is bounded, v i(y, 9i) has the single-crossing property 
(SCP) in (y, 9 { ) and y*(9 t , 9 2 ) is in the interior of Y. 

Note that by the Envelope Theorem, we have (almost everywhere) 

dS{6) _ dv l {y*{9 l ,9 2 ) 1 9 l ) 
d9 1 d9 1 

It is clear that if y*(9i, 9 2 ) depends on 9 2 and dvi Qgf^ depends on y, in general 9S Qgf^ 
will depend on 9 2 , and therefore S(9) will not be additively separable. In fact, 

• SCP of v 2 {-) implies that y*(9i,9 2 ) is strictly increasing in 9 2 . 

• SCP of vi(-) implies that dVl ^' 91 ^ [ s strictly increasing in 9±. 
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Consequently, is strictly increasing in 6 2 , thus S(9) is not additively separable. 
In this case, no ex post efficient choice rule is truthfully implementable in dominant 
strategies. 

If the wasted money is subtracted from the social surplus, it is no longer socially 
efficient to implement the decision rule y*(-). Instead, we may be interested, e.g., in 
maximizing the expectation 



E 



in the class of choice rules (?/(•), ti(-), • • • ,£/(•)) satisfying dominant incentive compatibil- 
ity, given a probability distribution (p over states 9. In this case, we may have interim 
efficiency as we will see in the section on Bayesian implementation. 

Nevertheless, when economic environments are "thin" , we can get some positive result- 
s. Groves and Loeb (1975), Tian (1996a), and Liu and Tian (1999) provide such a class 
of economic environments. An example of such utility functions are: Ui(ti,y) = tj + Vi(y) 
with Vi(y) = —l/2y 2 + 9iy for i — 1, 2, . . . , n with n > 3. 



15.8 Nash Implementation 

15.8.1 Nash Equilibrium and General Mechanism Design 

From Hurwicz's impossibility theorem, we know that, if one wants to have a mechanism 
that results in Pareto efficient and individually rational allocations, one must give up the 
dominant strategy implementation, and then, by Revelation Principle, we must look at 
more general mechanisms < M, h > instead of using a revelation mechanism. 

We know the dominant strategy equilibrium is a very strong solution concept. Now, if 
we adopt the Nash equilibrium as a solution concept to describe individuals' self-interested 
behavior, can we design an incentive compatible mechanism which implements Pareto 
efficient allocations? 

For e G E, a mechanism < M, h > is said to have a Nash equilibrium m* G M if 

hi(m*) ^ihiirm,™,*^) (15.24) 

for all TO; G Mi and all i. Denote by NE(e, T) the set of all Nash equilibria of the 
mechanism r for e G E. 
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It is clear every dominant strategy equilibrium is a Nash equilibrium, but the converse 
may not be true. 

A mechanism r =< M, h > is said to Nash-implement a social choice correspondence 
F on E if for every e G E 

(a) NE(e,F) ^ 0; 

(b) h(NE(e,T)) C F(e). 

It fully Nash implements a social choice correspondence F on .E if for every e G E 

(a) NE(e,T) ^0 

(b) M^(e,r)) = F(e). 

The following proposition shows that, if a truth-telling about their characteristics is a 
Nash equilibrium of the revelation mechanism, it must be a dominant strategy equilibrium 
of the mechanism. 

Proposition 15.8.1 For a revelation mechanism T =< E,h >, a truth-telling e* is a 
Nash equilibrium if and only if it is a dominant equilibrium 

h(e*, e_i) ^ h(e i7 e_;) V(e,, e_,) G £&i G iV. (15.25) 

Proof. Since for every e E E and «, by Nash equilibrium, we have 

h(e u e_j) /i(e-, e_j) for all e- G E'j. 

Since this is true for any (e^,e_j), it is a dominant strategy equilibrium. The proof is 
completed. 

Thus, we cannot get any new results if one insists on the choice of revelation mecha- 
nisms. To get more satisfactory results, one must give up the revelation mechanism, and 
look for a more general mechanism with general message spaces. 

Notice that, when one adopts the Nash equilibrium as a solution concept, the weak 
Nash implementation may not be a useful requirement. To see this, consider any social 
choice correspondence F and the following mechanism: each individual's message space 
consists of the set of economic environments, i.e., it is given by Mj = E. The outcome 
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function is denned as h(m) = a G -F(e) when all agents report the same economic envi- 
ronment rrii = e, and otherwise it is seriously punished by giving a worse outcome. Then, 
it is clear the truth-telling is a Nash equilibrium. However, it has a lot of Nash equilib- 
ria, in fact infinity number of Nash equilibria. Any false reporting about the economic 
environment = e' is also a Nash equilibrium. So, when we use Nash equilibrium as a 
solution concept, we need a social choice rule to be implemented or full implemented in 
Nash equilibrium. 

15.8.2 Characterization of Nash Implementation 

Now we discuss what kind of social choice rules can be implemented through Nash in- 
centive compatible mechanism. Maskin in 1977 gave necessary and sufficient conditions 
for a social choice rule to be Nash implementable (This paper was not published till 1999 
due to the incorrectness of the original proof. It then appeared in Review of Economic 
Studies, 1999). Maskin's result is fundamental since it not only helps us to understand 
what kind of social choice correspondence can be Nash implemented, but also gives basic 
techniques and methods for studying if a social choice rule is implementable under other 
solution concepts. 

Maskin's monotonicity condition can be stated in two different ways although they 
are equivalent. 

Definition 15.8.1 (Maskin's Monotonicity) A social choice correspondence F : E — >- 
A is said to be Maskin's monotonic if for any e, e G E, x G F(e) such that for all i and 
all y G A, x )pi y implies that x^^y, then x G F{e). 

In words, Maskin's monotonicity requires that if an outcome x is socially optimal 
with respect to economy e and then economy is changed to e so that in each individual's 
ordering, x does not fall below an outcome that is not below before, then x remains 
socially optimal with respect to e. 

Definition 15.8.2 (Another Version of Maskin's Monotonicity) A equivalent con- 
dition for a social choice correspondence F : E — >— > A to be Maskin's monotonic is that, 
if for any two economic environments e, e G E, x G F(e) such that x ^ F(e), there is an 
agent i and another y G A such that x )>=j y and y^iX. 
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Figure 15.3: An illustration of Maskin's monotonicity. 

Maskin's monotonicity is a reasonable condition, and many well known social choice 
rules satisfy this condition. 

Example 15.8.1 (Weak Pareto Efficiency) The weak Pareto optimal correspondence 
P w : E —>■—>■ A is Maskin's monotonic. 

Proof. If x G P w (e), then for all y G A, there exists i G iV such that x ^ y. Now if 
for any j G N such that x >pj y implies x^jy, then we have x^^y for particular i. Thus, 
x G P w (e). 

Example 15.8.2 (Majority Rule) The majority rule or call the Condorcet correspon- 
dence CON : E —>■—>■ A for strict preference profile, which is defined by 

CON(e) = {x G A : #{i\x ^ ^ #{% ^ for all y G A} 

is Maskin's monotonic. 

Proof. If x G CON(e), then for all y G A, 

#{i\x y t y} ^ y t x}. (15.26) 

But if e is an economy such that, for all i, x y implies x^ty, then the left-hand side 
of (15.26) cannot fall when we replace e by e. Furthermore, if the right-hand side rises, 
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then we must have x >~j y and y>~ix for some i, a contradiction of the relation between e 
and e, given the strictness of preferences. So x is still a majority winner with respect to 
e, i.e., x G CON(e). 

In addition to the above two examples, Walrasian correspondence and Lindahl correspon- 
dence with interior allocations are Maskin's monotonic. The class of preferences that 
satisfy "single-crossing" property and individuals' preferences over lotteries satisfy the 
von Neumann-Morgenstern axioms also automatically satisfy Maskin' monotonicity. 

The following theorem shows the Maskin's monotonicity is a necessary condition for 
Nash-implementability. 

Theorem 15.8.1 For a social choice correspondence F : E —>■—>■ A, if it is fully Nash 
implementable, then it must be Maskin's monotonic. 

Proof. For any two economies e,e6£,i6 F{ e ), then by full Nash implementability 
of F, there is m G M such that m is a Nash equilibrium and x = h(m). This means that 
h{m) )pi him'^m-i) for all i and G M { . Given x)piy implies a^y, h{m)^ih{m'^ m_j), 
which means that m is also a Nash equilibrium at e. Thus, by Nash implementability 
again, we have x G F(e). 

Maskin's monotonicity itself can not guarantee a social choice correspondence is fully 
Nash implementable. However, under the so-called no- veto power, it becomes sufficient. 

Definition 15.8.3 (No- Veto Power) A social choice correspondence F : E —)■—)■ A is 
said to satisfy no-veto power if whenever for any i and e such that x >pj y for all y G A 
and all j ^ i, then x G F{e). 

The no-veto power (NVP) condition implies that if n — 1 agents regard an outcome 
is the best to them, then it is social optimal. This is a rather weaker condition. NVP is 
satisfied by virtually all "standard" social choice rules, including weak Pareto efficient and 
Condorect correspondences. It is also often automatically satisfied by any social choice 
rules when the references are restricted. For example, for private goods economies with 
at least three agents, if each agent's utility function is strict monotonic, then there is no 
other allocation such that n — 1 agents regard it best, so the no- veto power condition 
holds. The following theorem is given by Maskin (1977, 1999), but a complete proof of 
the theorem was due to Repullo (1987). 
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Theorem 15.8.2 Under no-veto power, if Maskin's monotonicity condition is satisfied, 
then F is fully Nash implementable. 



Proof. The proof is by construction. For each agent i, his message space is defined by 



where Af = {1,2,...,}. Its elements are denoted by mi = (e z ,a\v l ), which means 
each agent i announces an economic profile, an outcome, and a real number. Notice 
that we have used e l and a % to denote the economic profile of all individuals' economic 
characteristics and the outcome announced by individual i, but not just agent i's economic 
characteristic and outcome. 

The outcome function is constructed in three rules: 

Rule(l). If m 1 = m 2 = . . . = m n = (e,a,v) and a G F(e), the outcome function is 
defined by 



In words, if players are unanimous in their strategy, and their proposed alterative a is 
F-optimal, given their proposed profile e, the outcome is a. 

Rule (2). For all j ^ i, m,j = (e,a,v), rrii = (e\a\v l ) ^ (e,a,v), and a e F(e), define: 



where L(a, ef) = {b e A : a Ri b} which is the lower contour set of Ri at a. That is, 
suppose that all players but one play the same strategy and, given their proposed profile, 
their proposed alternative a is F-optimal. Then, the odd-man-out, gets his proposed 
alterative, provided that it is in the lower contour set at a of the ordering that the other 
players propose for him; otherwise, the outcome is a. 

Rule(3). If neither Rule (1) nor Rule (2) applies, then define 



where i* = max{i e H : v l — maxji;- 7 }. In other words, when neither Rule (1) nor 
Rule (2) applies, the outcome is the alternative proposed by player with the highest index 
among those whose proposed number is maximal. 



Mi = E x A x Af 



h(m) = a. 




a 1 if a 1 e L(a, ef) 
a if a 1 L(a, ef). 



h(m) 
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Now we show that the mechanism (M, h) defined above fully Nash implements social 
choice correspondence F, i.e., h(N(e)) = F(e) for all e G E. We first show that F(e) C 
h(N(e)) for all e E E, i.e., we need to show that for all e G £ and a G -F(e), there exists 
a m G M such that a = /i(m) is a Nash equilibrium outcome. To do so, we only need to 
show that any m which is given by Rule (1) is a Nash equilibrium. Note that h{m) = a 
and for any given m- = (e'\ a' 1 , v H ) ^ rrii, by Rule (2), we have 



Hence, m is a Nash equilibrium. 

We now show that for each economic environment e G E, if m is a Nash equilibrium, 
then h(m) G F(e). First, consider the Nash equilibrium m is given by Rule (1) so that 
a G F(e), but the true economic profile is e', i.e., m G NE(e',T). We need to show 
a G F(e'). By Rule (1), h(m) = a. Let 6 G L(a, ej), so the new message for i is 
m'j = (e,b,v l ). Then h(m' i) m_ i ) = b by Rule (2). Now, since a is a Nash equilibrium 
outcome with respect to e', we have a = h{m) him'^m^i) = b. Thus, we have shown 
that for all i G iV and b G A, a Ri b implies a R' t b. Thus, by Maskin's monotonicity 
condition, we have a G F(e f ). 

Next, suppose Nash equilibrium m for e' is in the Case (2), i.e, for all j ^ i, rrij = 
(e,a,v), rrii ^ (e, a,v). Let a' = h{m). By Case (3), each j ^ i can induce any outcome 
b G A by choosing (_R J , b, v^) with sufficiently a large v 3 1 (which is greater than max^j Vk), 
as the outcome at (m^m^j), i.e,, 6 = h(m'j,m-j). Hence, m is a Nash equilibrium with 
respect to e' implies that for all j ^ i, we have 



Thus, by no- veto power assumption, we have a' G F(e f ). 

The same argument as the above, if m is a Nash equilibrium for e 1 is given by Case 
(3), we have a' G F(e'). The proof is completed. 

Although Maskin's monotonicity is very weak, it is violated by some social choice rules 
such as Solomon's famous judgement. Solomon's solution falls under Nash equilibrium 




a 1 if a 1 G L(a, e,) 
a if a 1 £ L(a, e^). 



and thus 



h(m) Ri him'^m^i) Vm'; G Mj. 



a' i?j 6. 
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implementation, since each woman knows who is the real mother. His solution, which 
consisted in threatening to cut the baby in two, is not entirely foolproof. What would 
be have done if the impostor has had the presence of mind to scream like a real mother? 
Solomon's problem can be formerly described by the language of mechanism design as 
follows. 

Two women: Anne and Bets 

Two economies (states): E = {«,/?}, where 

a: Anne is the real mother 
(3: Bets is the real mother 

Solomon has three alternatives so that the feasible set is given by A = {a, b, c}, 
where 

a: give the baby to Anne 
b: give the baby to Bets 
c: cut the baby in half. 

Solomon's desirability (social goal) is to give the baby to the real mother, 

f(a) = a if a happens 
f(/3) — b if p happens 

Preferences of Anne and Bets: 

For Anne, 

at state a, a y'X by- a A c 
at state (3: a c b 

For Bets, 

at state a, b c a 
at state (3: b >-^ B a y^ c 
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To see Solomon's solution does not work, we only need to show his social choice goal is 
not Maskin's monotonic. Notice that for Anne, since 

a y'X b,c, 

a y A b, c, 

and f(a) = a, by Maskin's monotonicity, we should have f((3) = a, but we actually have 
f(/3) = b. So Solomon's social choice goal is not Nash implementable. 

15.9 Better Mechanism Design 

Maskin's theorem gives necessary and sufficient conditions for a social choice correspon- 
dence to be Nash implementable. However, due to the general nature of the social choice 
rules under consideration, the implementing mechanisms in proving characterization the- 
orems turn out to be quite complex. Characterization results show what is possible for 
the implementation of a social choice rule, but not what is realistic. Thus, like most char- 
acterization results in the literature, Maskin's mechanism is not natural in the sense that 
it is not continuous; small variations in an agent's strategy choice may lead to large jumps 
in the resulting allocations, and further it has a message space of infinite dimension since 
it includes preference profiles as a component. In this section, we give some mechanisms 
that have some desired properties. 

15.9.1 Groves-Ledyard Mechanism 

Groves-Ledyard Mechanism (1977, Econometrica) was the first to give a specific mecha- 
nism that Nash implements Pareto efficient allocations for public goods economies. 

To show the basic structure of the Groves-Ledyard mechanism, consider a simplified 
Groves-Ledyard mechanism. Public goods economies under consideration have one private 
good Xi, one public good y, and three agents (n — 3). The production function is given 
by y = v. 

The mechanism is defined as follows: 

Mi = Ri, i = 1,2,3. Its elements, can be interpreted as the proposed 
contribution (or tax) that agent i is willing to make. 
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ti(m) = mf + 2mjmk: the actual contribution U determined by the mechanism 
with the reported mj. 

y(m) = (mi + m 2 + m 3 ) 2 : the level of public good y. 

Xi(m) = Wi — ti{m): the consumption of the private good. 

Then the mechanism is balanced since 

3 3 

i=i i=i 

3 

= Xj + (mi + m 2 + m 3 ) 2 
i=i 

n 3 

= Xi + y = ^2 w i 

i=3 i=l 

The payoff function is given by 

Vi(m) = Ui(xi(m) : y(m)) 

= Ui(wi-ti(m),y(m)). 

To find a Nash equilibrium, we set 

dvi(m) 
drrii 

Then, 



= (15.27) 



>M { '" ] (M '-2m,) + ^2(mi + m 2 + m 3 ) = (15.28) 



dm. Ox, dy 

and thus 



duj 

dy mj, 

|^ mi + mi + m 3 



(15.29) 



When Ui are quasi-concave, the first order condition will be a sufficient condition for Nash 
equilibrium. 

Making summation, we have at Nash equilibrium 

3 du± 3 1 

y- _av_ = y- m i = 1 = _JL_ , 15 3Q n 

~* P 1 ^mx + mi + mz f'(v) 

i — 1 ox ■i i — 1 

that is, 

3 

J2MRS yXt =MRTS yv . 

i=i 
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Thus, the Lindahl-Samuelson and balanced conditions hold which means every Nash e- 
quilibrium allocation is Pareto efficient. 

They claimed that they have solved the free-rider problem in the presence of public 
goods. However, economic issues are usually complicated. Some agreed that they indeed 
solved the problem, some did not. There are two weakness of Groves-Ledyard Mechanism: 
(1) it is not individually rational: the payoff at a Nash equilibrium may be lower than at 
the initial endowment, and (2) it is not individually feasible: Xj(m) = Wi — ti(m) may be 
negative. 

How can we design the incentive mechanism to pursue Pareto efficient and individually 
rational allocations? 

15.9.2 Walker's Mechanism 

Walker (1981, Econometrica) gave such a mechanism. Again, consider public goods e- 
conomies with n agents, one private good, and one public good, and the production 
function is given by y — f(v) = v. We assume n ^ 3. 
The mechanism is defined by: 

Mi = R 

y{m) = J27=i m i : ^ ne level of public good. 

qi(m) = ^ + m i+ i — m i+2 : personalized price of public good. 

ti(m) = qi(m)y(m): the contribution (tax) made by agent i. 

Xi(m) = Wi — ti(m) = Wi — qi(m)y(m): the private good consumption. 

Then, the budget constraint holds: 



Xiirn) + qi(m)y(m) = w t 



Mrrii e Mi 



(15.31) 



Making summation, we have 



n 



n 



n 




i=l 



i=l 



i=l 



and thus 



n 



n 




i=l 



1=1 



489 



which means the mechanism is balanced. 
The payoff function is 

Vi(m) = Ui(xi,y) 

= Ui(wi - qi(m)y(m),y(m)) 

The first order condition for interior allocations is given by 



dv; dm 



-y(m) + qi{m)- 



drrij dirii 



duj dyjm) 
dy drrii 



dui 

-qi(m) + — = 



dxi dy 

duj 

=> ^L- = qi ( m ) (FOC) for the Lindahl Allocation 

dxi 

N(e) C L(e) 

Thus, if are quasi-concave, it is also a sufficient condition for Lindahl equilibrium. We 
can also show every Lindahl allocation is a Nash equilibrium allocation, i.e., 

L(e) C JV(e) (15.32) 

Indeed, suppose [(x*,y*),q*, . . . ,q*] is a Lindahl equilibrium. Let m* be the solution of 
the following equation 

1 

q = - +mi+1 -m i+2 ^ 
n 

n 

y* = ^rrii 
i=i 

Then we have Xi{m*) = x*, y(m*) = y* and qi(m*) = q* for alii e iV. Thus from 
(x(m*,m-i),y(m*,m-i)) G and Xj(m*,m_j) + q i (m*)y( y m*,m^ i ) = Wi for all i G 
and m 8 G Mj, we have (xj(m*), y{m*)) Ri (xi(m*,m-i),y(m*,rri-i)), which means that 
m* is a Nash equilibrium. 

Thus, Walker's mechanism fully implements Lindahl allocations which are Pareto ef- 
ficient and individually rational. 

Walker's mechanism also has a disadvantage that it is not feasible although it does solve 
the individual rationality problem. If a person claims large amounts of ij, consumption 
of private good may be negative, i.e., Xi = Wi — U < 0. Tian proposed a mechanism that 
overcomes Walker's mechanism's problem. Tian's mechanism is individually feasible, 
balanced, and continuous. 
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15.9.3 Tian's Mechanism 



In Tian's mechanism (JET, 1991), everything is the same as Walker's, except that y(m) 
is given by 

aim) if Yh=i m i > a ( m ) 
VW = \ Er=i m i if 0^^ =1 m^a(m) (15.33) 

o if Er=i^<o 

where a{m) = min i6JV '( m ^ ^^-y that can be regarded as the feasible upper bound for having 
a feasible allocation. Here N'(m) = {i & N : qi(m) > 0}. 

An interpretation of this formulation is that if the total contributions that the agents 
are willing to pay were between zero and the feasible upper bound, the level of public 
good to be produced would be exactly the total taxes; if the total taxes were less than 
zero, no public good would be produced; if the total taxes exceeded the feasible upper 
bound, the level of the public good would be equal to the feasible upper bound. 




AW 



Figure 15.4: The feasible public good outcome function Y(m). 



To show this mechanism has all the nice properties, we need to assume that preferences 
are strictly monotonically increasing and convex, and further assume that every interior 
allocation is preferred to any boundary allocations: For all i G N, (x i: y) Pi (x'^y 1 ) for all 
(xi,y) G M+ + and (x' i: y') G cM+, where <91^_ is the boundary of 



>2 
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To show the equivalence between Nash allocations and Lindahl allocations. We first 
prove the following lemmas. 

Lemma 15.9.1 If (x(m*),y(m*)) G N M>h (e), then (xi(m*),y(m*)) G R 2 ++ for alii G N. 

Proof: We argue by contradiction. Suppose (xi(m*),y(m*)) G cM+. Then Xi(m*) = 
for some i G N or y(m*) = 0. Consider the quadratic equation 

y=— — — r, (15.34) 
2(y + c)' 1 ; 

where w* = mm iGN Wi] c = b + n^2™ =1 \m*\, where b = 1/n. The larger root of (15.34) 
is positive and denoted by y. Suppose that player i chooses his/her message = y — 
£^ m*. Then y = m» + YT i+i m* > and 

n 

wj - qj(m*,m_i)y ^ Wj - [b + (nj^ \m*\ + y)]y 

s=l 

n 

= Wj - (y + b + n^2\m*\)y 

s=l 

= Wj - w*/2 ^ Wj/2 > (15.35) 

for all j G N. Thus, y(m*, m_j) = y> and Xj(m*, m_j) = Wj—qj(m*,m_i)y(m*/mi,i) = 
Wj — qj(m*,m^i)y > for all j G TV. Thus (x i (m*,m_ i ),y(m*,m„ i )) P { (xj(m*), y(m*)) by 
that every interior allocation is preferred to any boundary allocations, which contradicts 
the hypothesis (x(m*), y(m*)) G N Mth (e). Q.E.D. 

Lemma 15.9.2 If (x(m*),y(m*)) G N M:h (e), then y(m*) is an interior point of[0,a(m)} 
and thus y(m*) = Xir=i m i- 

Proof: By Lemma 15.9.1, y(m*) > 0. So we only need to show y(m*) < a(m*). Sup- 
pose, by way of contradiction, that y(m*) = a(m*). Then Xj(m*) = Wj — qj(m*)y(m*) = 
Wj — qj{m*)a{m*) = Wj — Wj = for at least some j G N. But x(m*) > by Lemma 
15.9.1, a contradiction. Q.E.D. 

Proposition 15.9.1 If the mechanism has a Nash equilibrium m* , then (x(m*),y(m*)) is 
a Lindahl allocation with (qi(m*), . . . , q n {m*)) as the Lindahl price vector, i.e., N Mjh (e) C 
L{e). 
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Proof: Let m* be a Nash equilibrium. Now we prove that (x(m*),y(m*)) is a Lindahl 
allocation with (qi(m*), . . . ,q n (m*)) as the Lindahl price vector. Since the mechanism is 
feasible and YH=i Qi( m *) = 1 as well as x^m*) + qi(m*)y(m*) = Wi for all % G N, we only 
need to show that each individual is maximizing his/her preference. Suppose, by way of 
contradiction, that there is some (xi,y) G such that (xi,y) Pi (Xi(m*),Y(m*)) and 
Xi + qi(m*)y ^ Wj. Because of monotonicity of preferences, it will be enough to confine 
ourselves to the case of Xj + qi(m*)y = Wi. Let (x iX , y\) = (Xxi + (1 — A)xj(m*), \y + (1 — 
A)y(m*)). Then by convexity of preferences we have (xi\,y\) Pi (xi(m*),y(m*)) for any 
< A < 1. Also (xix,y\) G R+ and x iA + qi{m*)y x = Wi. 

Suppose that player i chooses his/her message = y\ — YTj^i m *j- Since y{m*) = 
YTj=i m *i by Lemma 15.9.2, rrii = y\ — y(m*) + m*. Thus as A — )■ 0, y A — > y(m*), and 
therefore m; — >■ m*. Since Xj(m*) = Wj — qj(m*)y(m*) > for all j G iV by Lemma 15.9.1, 
we have Wj — qj(m*,m_i)y\ > for all j G N as A is a sufficiently small positive number. 
Therefore, y(m*,m_j) = and Xj(m*,m_j) = u>j — qi(m*)y(m*,m_i) = Wi — qi{m*)y\ = 
x iX . Prom (x iX , y\) Pi (xi(m*), y(m*)), we have (xi(m*, m_;), y(m*, m_;)) (^(m*), y(m*)) 
This contradicts (x(m*),y(m*)) G N M ,h{e). Q.B.D. 

Proposition 15.9.2 // (x*,y*) is a Lindahl allocation with the Lindahl price vector q* = 
(qh ■ ■ ■ iQn)> ttien there is a Nash equilibrium m* of the mechanism such that Xi(m*) = x*, 
qi(m*) = q*, for all i G N, y(m*) = y* , i.e., L(e) C N M>h (e). 

Proof: We need to show that there is a message m* such that (x*,y*) is a Nash 
allocation. Let m* be the solution of the following linear equations system: 

1 

Qi = -+mi +1 -m i+2 , 
n 

n 

y* = 5Z m * 

i=i 

Then we have Xi(m*) = x*, y{m*) = y* and %(m*) = q* for all % G N. Thus from 
(x(m*,m-i),y(m*,m-i)) G IR+ and Xj(m*,m_j) + qi(m*)y(m*,m^i) = Wi for all i G iV 
and rrii G Mj, we have (xj(m*),y(m*)) -Rj (xj(m*, m_j), y(m*, m_j)). Q.E.D. 

Thus, Tian's mechanism Nash implements Lindahl allocations. 
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15.10 Incomplete Information and Bayesian-Nash Im- 
plementation 



15.10.1 Bayesian-Nash Implementation Problem 

Nash implementation has imposed a strong assumption on information requirement. Al- 
though the designer does not know information about agents' characteristics, Nash equi- 
librium assume each agent knows characteristics of the others. This assumption is hardly 
satisfied in many cases in the real world. Can we remove this assumption? The answer is 
positive. One can use the Bayesian-Nash equilibrium, introduced by Harsanyi, as a solu- 
tion concept to describe individuals' self-interested behavior. Although each agent does 
not know economic characteristics of the others, he knows the probability distribution 
of economic characteristics of the others. In this case, we can still design an incentive 
compatible mechanism. 

Throughout we follow the notation introduced in the beginning of this chapter. Also, 
for simplicity, assume each individual's preferences are given by parametric utility func- 
tions Ui(x,9i). Assume that all agents and the designer know that the vector of types, 
— . . . , 9 n ) is distributed according to ip(6) a priori on a set 0. 

Each agent knows his own type 0j, and therefore computes the conditional distribution 
of the types of the other agents: 

As usual, a mechanism is a pair, T = (M, h). Given m with rrii : 6« — > Mi, agent i's 
expected utility at 0j is given by 

n\M,h){™{e);8i) = E e _ i [u i (h(m i (9 i ),m- i (9- i )),9 i )\9 i ] 

= [ u i (h(m(e)),e i ) (p (e. i \e i )d9. i . (15.36) 

Je-i 

A strategy m*(-) is a Bayesian-Nash equilibrium (BNE) of (M, h) if for all 0j G 6j, 

U i (m*(9 i );9 i ) ^ IT^mi, m'^-Ojfli) Vm, e M<. 

That is, if player i believes that other players are using strategies then he maximizes 

his expected utility by using strategy m*(-). Denote by B(9,T) the set of all Bayesian- 
Nash equilibria of the mechanism. 
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Remark 15.10.1 In the present incomplete information setting, a message nrii : 0j — > Mj 
is a dominant strategy strategy for agent % in mechanism r =< M, h > if for all 0j G 0j 
and all possible strategies m_j(0_j), 

^["^W^.m-.^-,)),^) I 0i] > ^[Ui^^-ilfl-i)),^) | (15.37) 
for all m- e Mi. 

Since condition (15.37) holding for all 0j and all m_j(#_j) is equivalent to the condition 
that, for all 6>j G 0j, 

Ui(h(mi(0i),m-i), 9i) > Uj(/i(m-, m_j), 0*) (15.38) 

for all G Mj and all m_j G M_j, 2 this leads the following definition of dominant 
equilibrium. 

A message m* is a dominant strategy equilibrium of a mechanism r =< M, /i > if for 
all % and all 0j G 0j, 

u i (h(m*(9 i ),m- i ),9 i ) > Uj(/i(m-(0i), m„j), 0j 

for all G Mj and all m_j G M_j, which is the same as that in the case of complete 
information. 

Remark 15.10.2 It is clear every dominant strategy equilibrium is a Bayesian-Nash 
equilibrium, but the converse may not be true. Bayesian-Nash equilibrium requires more 
sophistication from the agents than dominant strategy equilibrium. Each agent, in order 
to find his optimal strategy, must have a correct prior </?(•) over states, and must correctly 
predict the equilibrium strategies used by other agents. 

Like Nash implementation, Bayesian-Nash incentive compatibility involves the rela- 
tionship between F(0)and B(9,T). 

A mechanism T =< M, h > is said to Bayesian-Nash implement a social choice corre- 
spondence F on if for every 9 G 0, 

(a) B(9,T)^®; 

(b) h(B(9,T))CF(9). 

2 (15.37) implies (15.38) by noting m_j(0_j) = m_j, and (15.38) implies (15.37) because 
Ee_ i [u i (h(m i ,m- i (0- i )),O i ) | 8 l ] = Uj(/i(m i; m_j), 0j) for all m 4 e Mj. 
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It fully Bayesian-Nash implements a social choice correspondence F on 6 if for every 
6 E 0, 

(a) B(6,r)^V 

(b) h(B(9,T))=F(9). 

If for a given social choice correspondence F, there is some mechanism that Bayesian- 
Nash implements F, we call this social choice correspondence Bayesian-Nash imple- 
mentable. 

For a general social choice correspondence F : — > Z, Pastlewaite-Schmeidler (JET, 
1986), Palfrey-Srivastava (RES, 1989), Mookherjee-Reichelstein (RES, 1990), Jackson (E- 
conometrica, 1991), Dutta-Sen (Econometrica, 1994), Tian (Social Choices and Welfare, 
1996; Journal of Math. Econ., 1999) provided necessary and sufficient conditions for a so- 
cial choice correspondence F to be Bayesian-Nash implementable. Under some technical 
conditions, they showed that a social choice correspondence F is Bayesian-Nash imple- 
mentable if and only if F is Bayesian monotonic and Bayesian-Nash incentive compatible. 

To see specifically what is implementable in BNE, in the remainder of this section, 
the social choice goal is given by a single-valued social choice function / : — > Z. 
Furthermore, by Revelation Principle below, without loss of generality, we can focus on 
direct revelation mechanisms. 

Definition 15.10.1 Choice rule /(•) is truthfully implementable in BNE if for all 9i G 0, 

and i & I, m*(9i) = 9; L is a BNE of the direct revelation mechanism Y = (0 1; • • • , i? /(•)), 
i.e., Wi,W9i e 0j, we have 

E e _ i [u i (f(9 i ,9^ i ),9 i ) | 9i] > Ee^fiPiJ-dA) I Oi\,W> e 

The concept of Bayesian incentive-compatibility means that every agent will report his 
type truthfully provided all other agents are employing their truthful strategies and thus 
every truthful strategy profile is a Bayesian equilibrium of the direct mechanism (S,x). 
Notice that Bayesian incentive compatibility does not say what is the best response of an 
agent when other agents are not using truthful strategies. So it may also contain some 
undesirable equilibrium outcomes when a mechanism has multiple equilibria. The goal for 
designing a mechanism is to reach a desirable equilibrium outcome, but it may also result 
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in an undesirable equilibrium outcome. Thus, while the incentive compatibility require- 
ment is central, it may not be sufficient for a mechanism to give all of desirable outcomes. 
The severity of this multiple equilibrium problem has been exemplified by Demski and 
Sappington (JET, 1984), Postlewaite and Schmeidler (JET, 1986), Repullo (Economet- 
rica, 1988), and others. The implementation problem involves designing mechanisms to 
ensure that all equilibria result in desirable outcomes which are captured by the social 
choice set. 

These inequalities are called Bayesian Incentive-Compatibility Constraints. 

Proposition 15.10.1 (Revelation Principle) Choice rule /(•) is implementable in B- 
NE if and only if it is truthfully implementable in BNE. 

Proof. The proof is the same as before: Suppose that there exists a mechanism 
T = (Mi,-- - ,M n ,g(-)) and an equilibrium strategy profile m*(-) = (m*(-),-- - ,m*(-)) 
such that g(m*(-)) = /(•) and Vi, V#j G Oj, there is 

One way to deviate for agent i is by pretending that his type is rather than 9 iy i.e., 
sending message m\ = m*(§i). This gives 

E e _Au t (g(m*(6 i ),m*_ t (6_ l )),6 t ) | 6 t ] > E e _ t [ut (g (m* (§i), m% (6-i)) , 0$ \ ^],V^ e 0,. 

But since g(m*(6)) = f(9), V# G 0, we must have Wi,W9i G Oj, there is 

Ee.Mfi^e-ilOi) | 9i] > Ee.MfikO-iiei) \ 9^9 % G 0,. 

15.10.2 Ex-Post Efficient Implementation by the Expected Ex- 
ternality Mechanism 

Once again, let us return to the quasilinear setting. From the section on dominant s- 
trategy implementation, we know that there is, in general, no ex-post Pareto efficient 
implementation if the dominant equilibrium solution behavior is assumed. However, in 
quasilinear environments, relaxation of the equilibrium concept from DS to BNE allows 
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us to implement ex post Pareto efficient choice rule /(•) = (y*(-),ti(-), ■ ■ ■ ,U(-)), where 
V0 G 6, 

n 

y*(6) G argmax^y^^y,^), 
i=i 

and there is a balanced budget: 

n 

$>(*) = o. 

1=1 

The Expected Externality Mechanism described below was suggested independently 
by D' Aspermont and Gerard- Varet (1979) and Arrow (1979) and are also called AGV 
mechanisms in the literature. This mechanism enables us to have ex-post Pareto efficient 
implementation under the following additional assumption. 

Assumption 15.10.1 Types are distributed independently: (f)(9) = Hi<j>i(9i),V9 G 0. 

To see this, take the VCG transfer for agent % and instead of using other agents' 
announced types, take the expectation over their possible types 

U (9) = E e _ t [J2 vj (y* (k 0-i), 9j)} + d t (Li) 

(by the above assumption the expectation over 6Lj does not have to be taken conditionally 
on 9i). Note that unlike in VCG mechanisms, the first term only depends on agent i's 
announcement L and not on other agents' announcements. This is because it sums the 
expected utilities of agents j ^ i assuming that they tell the truth and given that i 
announced §i, and does not depend on the actual announcements of agents j ^ i. This 
means that ti(-) is less "variable", but on average it will cause i's incentives to be lined 
up with the social welfare. 

To see that i's incentive compatibility is satisfied given that agent j ^ i announce 
truthfully, observe that agent % solves 

n 

maxEoJviiy^O^OA+tiihe-i)] = maxE e _^v j (y*(O i ,0- i ),0 j )]+ <U(0-i). 

Again, agent i's announcement only matters through the decision y*(9i, 9-i). Further- 
more, by the definition of the efficient decision rule y*(-), for each realization of 9-i G 0_i, 
agent i's expected utility is maximized by choosing the decision y*(9i,9-i), which can be 
achieved by announcing truthfully: 9; L = 9i. Therefore, truthful announcement maximizes 
the agent i's expected utility as well. Thus, BIC is satisfied. 
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Remark 15.10.3 The argument relies on the assumption that other agents announce 
truthfully. Thus, it is in general not a dominant strategy for agent i to announce the 
truth. Indeed, if agent % expects the other agents to lie, i.e., announce ^ 9_, h 

then agent i's expected utility is 

n 

which may not be maximized by truthfully announcement. 

Furthermore, we can now choose functions dj(-) so that the budget is balanced. To 
see this, let 

m) = E e _ i [J2v J (y*(9 l ,9^ l ),9 J )], 

so that the transfers in the Expected Externality mechanism are U(9) = + di(6-i). 
We will show that we can use the d(-) functions to "finance" the £(•) functions in the 
following way: 
Let 

di(^) = -E^rrW)- 

Then 

j=l j=l i^j jjti i=l 

n n 

= -^rE( n -Wi) = -E» 

ft -L 

i=i i=i 

and therefore 

n n n 

i=i i=i i=i 

Thus, we have shown that when agents's Bernoulli utility functions are quasilinear 
and agents' types are statistically independent, there is an ex-post efficient social choice 
function that is implementable in Bayesian-Nash equilibrium. 

Application to a Linear Model 

Consider the quasi-linear environment with the decision set F C 1", and the type 
spaces 0j = [9^ 9i] C K. for all i. Each agent i's utility takes the form 

Oij/i + u. 
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Thus, agent % cares only about component yi of the public outcome. Note that these 
payoffs satisfy SCP. 

The decision set Y depends on the application. 

Two examples: 

1. Allocating a private good: Z = {y G {0, l} n : J2iUi = !}■ 

2. Provision of a nonexcludable public good: Y = {(q, ■ ■ ■ , q) G M ra : q G {0, 1}}. 

We can fully characterize both dominant strategy (DIC) and Bayesian Incentive Com- 
patibility (BIC) choice rules in this environment using the one-agent characterization in 
Lemma 15.7.1. 

Let Ui(0) = 6iy{6) + U{9). Applying Lemma 15.7.1 to characterize DIC constraints 
for any fixed 9-i, we have 

Proposition 15.10.2 In the linear model, choice rule (y(-),ti(-), ■ ■ ■ ,ti(-)) is Dominant 
Incentive Compatible if and only if for all i E I, 

(1) yi(9i,0-i) is nondecreasing in 6i, (DM); 

(2) U t (6 z , B-i) = Uii^, 6-i) + y t (r, 6_ t )dr, V0 e 0. (DICFOC) 

As for agent V s interim expected utility when he announces type 9i, it is 

thus, it depends only on the interim expected consumption Ee_ z yi(6i,6^i) and transfer 
E0_ t ti(9i,9-i). BIC means that truth-telling is optimal at the interim stage. Therefore, 
BIC choice rules can be fully characterized using the one-agent result in Lemma 15.7.1. 

Proposition 15.10.3 In the linear model, choice rule (y(-), ti(-), • • • ^ s Bayesian 

Incentive Compatible if and only if for all i & I, 

(1) Ee_ i yi{9i,9^, i ) is nondecreasing in 9 t (BM); 

(2) E _.Ui(9i, 9-i) = Ee_Mi-i, M + jj Eo-Mt, 0-i)dr, G 6,. (BIC- 
FOC) 
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Note that (BICFOC) is a pooling of (DICFOC), and similarly (BM) is a pooling of 
(DM). The latter means that BIC allows to implement some decision rules that are not 
DS implementable. 

The proposition implies that in any two BIC mechanisms implementing the same 
decision rule y(-), the interim expected utilities Eq JJ^O) and thus the transfers Eq J^O) 
coincide up to a constant. 

Corollary 15.10.1 In the linear model, for any BIC mechanism implementing the ex 
post Pareto efficient decision rule y(-), there exists a VCG mechanism with the same 
interim expected transfers and utilities. 

Proof. If (y*(-), t(-)) is a BIC mechanism and (y*(-),t(-)) is a VCG mechanism, then 
by Proposition 15.10.3, E e _.ti(e) = E e _.ii(ff) +Q, V0 4 G 0;. Then letting t^O) = ti(9) + c i: 
(y*(-),t(-)) is also a VCG mechanism, and E e _ i i i (6) = E e _ i t i (6). 

For example, the expected externality mechanism is interim-equivalent to a VCG 
mechanism. Its only advantage is that it allows to balance the budget ex post, i.e., in 
each state of the world. More generally, if a decision rule y(-) is DS implementable, the 
only reason to implement it in a BIC mechanism that is not DIC is whether we care about 
ex post transfers/utilities rather than just their interim or ex ante expectations. 

15.10.3 Participation Constraints 

Thinking of the mechanism as a contract raises the following issues: 

• Will the agents accept it voluntarily, i.e., are their participation constraints satisfied? 

• If one of the agents designs the contract, he will try to maximize his own payoff sub- 
ject to the other agents i participation constraints. What will the optimal contract 
look like? 

To analyze these questions, we need to imposes additional restrictions on the choice 
rule in the form of participation constraints. These constraints depend on when agents 
can withdraw from the mechanism, and what they get when they do. Let Ui{9i) be the 
utility of agent i if he withdraws from the mechanism. (This assumes that when an agent 
withdraws from the mechanism he does not care what the mechanism does with other 
agents.) 

501 



Definition 15.10.2 The choice rule /(•) is 

1. Ex Post Individually Rational if for all i, 

u i (e) = u i (f(e),e)>u i (e i ),wee. 

2. Interim Individually Rational if for all i, 

E e _ i [U i {6 h 6- i )\>u i {e i )y6eQ. 

3. Ex ante Individually Rational if for all i, 

Note that ex post IRs imply interim IR, which in turn imply ex ante IRs, but the 
reverse may not be true. Then, the constraints imposed by voluntary participation are 
most sever when agents can withdraw at the ex post stage. 

Thus, when agents' types are not observed, the set of social choice functions that can 
be successfully implemented are those that satisfy not only the incentive compatibility, 
respectively, either a dominant strategy or Bayesian Nash dominant strategy, depending 
on the equilibrium concept used, but also participation constraints that are relevant in 
the environment under study. 

The ex post IRs arise when the agent can withdraw in any state after the announcement 
of the outcome. For example, they are satisfied by any decentralized bargaining procedure. 
These are the hardest constraints to satisfy. 

The Interim IRs arise when the agent can withdraw after learning his type Of, but 
before learning anything about other agents' types. Once the agent decides to participate, 
the outcome can be imposed on him. These constraints are easier to satisfy than ex post 
IR. 

Finally, with the ex-ante participation constraint the agent can commit to participating 
even before his type is realized. These are the easiest constraints to satisfy. For example, 
in a quasi-linear environment, whenever the mechanism generates a positive expected 
surplus, i.e., 

Ee[Y^Ui(e)] > 

i i 
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all agents' ex ante IR can be satisfied by reallocating expected surplus among agents 
through lump-sum transfers, which will not disturb agents' incentive constraints or budget 
balance. 

For this reason, we will focus mainly on interim IR. In the following, we will illus- 
trate further the limitations on the set of implementable social choice functions that 
may be caused by participation constraints by the important theorem given by Myerson- 
Satterthwaite (1983). 

The Myerson-Satterthwaite Theorem 

Even though Pareto efficient mechanisms do exist (e.g. the expected externality mech- 
anism), it remains unclear whether such a mechanism may result from private contracting 
among the parties. We have already seen that private contracting need not yield efficien- 
cy In the Principal-Agent model, the Principal offers an efficient contract to extract the 
agent's information rent. However, this leaves open the question of whether there is some 
contracting/bargaining procedure that would yield efficiency. For example, in the P-A 
model, if the agent makes an offer to the principal, who has no private information on his 
own, the agent would extract all the surplus and implement efficient as a result. There- 
fore, we focus on a bilateral situation in which both parties have private information. In 
this situation, it turns out that generally there does not exist an efficient mechanism that 
satisfies both agents' participation constraints. 

Consider the setting of allocating an indivisible object with two agents - a seller and 
a buyer: / = {S,B}. Each agent's type is 0j G ©j = [9 i: 9i\ C K, where Oi ~ are 
independent, and tpi(-) > for all 9i G ©«. Let y G {0, 1} indicate whether B receives the 
good. A choice rule is then f{9) = (y(9),ti(9),t 2 (9)). The agents' utilities can then be 
written as 

u B (y,9 B ) = 9 B y + t B , 
u s (y,9 s ) = -9 s y + t s . 
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It is easy to see that an efficient decision rule y*(9) must have 3 

[ 1 if 9 B > 9 S , 

y*(9 B ,9 s ) = \ 

(0 if 9 B < 9 S . 

We could use an expected externality mechanism to implement an efficient decision 
rule in BNE with ex post budget balance. However suppose that we have to satisfy Interim 
IR: 

E es [9 B y(9 B ,9 s )+t B (9 B ,9 s )] > 0, 
Ee B [-8sy(9B,9s)+ts(9 B ,9 s )} > 0. 

As for budget balance, let us relax this requirement by requiring only ex ante Budget 
Balance: 

Ee[t B (9 B ,9s)+ts(9 B ,9 s )}<0. 

Unlike ex post budget balance considered before, ex ante budget balance allows us to 
borrow money as long as we break even on average. It also allows us to have a surplus of 
funds. The only constraint is that we cannot have an expected shortage of funds. 

We can then formulate a negative result: 

Theorem 15.10.1 (Myerson-Satterthwaite) In the two-party trade setting above with 
{9 By 9b)^(9 s , 9 s ) 7^ (gains from trade are possible but not certain) there is no BIC choice 
rule that has the efficient decision rule and satisfies ex ante Budget Balance and interim 
IR. 

Proof. Consider first the case where [9 B: 9 B ] = [9 S: 9~s] = [9,9]. 

By Corollary 15.10.1 above we can restrict attention to VCG mechanisms while pre- 
serving ex ante budget balance and interim IR. Such mechanisms take the form 

t B (9 B ,9s) = -9 s y*(9B,9s) + d B (9s), 
ts(9 B ,9 s ) = 9 B y*(9 B ,9 s ) + d s (9 s ). 

By interim IR of B's type 9, using the fact that y*{9_,9 s ) = with probability 1, 
we must have E ds d B (9 s ) > 0. Similarly, by interim IR of S's type 9, using the fact 



3 Given our full support assumption on the distributions, ex ante efficiency dictates that the decision 
rule coincide with y*(-) almost everywhere. 
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that y*(9 B: 9) = with probability 1, we must have Eg B ds(9 B ) > 0. Thus, adding the 
transfers, we have 

Ee[t B (O B ,0 s )+t s (O B ,0 s )] = E e [(9 B -9 s )y*(9 B ,9 s )]+E es [d B (9 s )]+Ee B [d s (9 B )] 

> Ee[mzx{9 B - 9 S }\ > 

since Pr(9 B > 9 S ) > 0. Therefore, ex ante budget balance cannot be satisfied. 

Now, in the general case, let (9, 9) = (9_ B , 9 B ) D (9 S , 9s), and observe that any type of 
either agent above 9 [or below 9] has the same decision rule, and therefore (by IC) must 
have the same transfer, as this agent's type 9 [resp. 9]. Therefore, the payments are the 
same as if both agents having valuations distributed on [9,9]; with possible atoms on 9 
and 9. The argument thus still applies. 

The intuition for the proof is simple: In a VCG mechanism, in order to induce truthful 
revelation, each agent must become the residual claimant for the total surplus. This means 
that in case trade is implemented, the buyer pays the seller' s cost for the object, and 
the seller receives the buyer' s valuation for the object. Any additional payments to the 
agents must be nonnegative, in order to satisfy interim IRs of the lowest-valuation buyer 
and the highest-cost seller. Thus, each agent' s utility must be at least equal to the 
total surplus. In BNE implementation, agents receive the same expected utilities as in 
the VGC mechanism, thus again each agent' s expected utility must equal at least to the 
total expected surplus. This cannot be done without having an expected infusion of funds 
equal to the expected surplus. 

Interpretation of the Theorem: Who is designing the mechanism to maximize 
expected surplus subject to agents' interim IRs? 

• Agents themselves at the ex ante stage? They would face ex ante rather than interim 
IRs, which would be easy to satisfy. 

• An agent at the interim stage? He would be interested not in efficiency but in 
maximizing his own payoff. 

• A benevolent mediator at the interim stage? But where would such a mediator 
come from? 
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A better interpretation of the result is as an upper bound on the efficiency of decentralized 
bargaining procedures. Indeed, any such procedure can be thought of as a mechanism, 
and must satisfy interim IR (indeed, even ex post IR), and ex ante budget balance (indeed, 
even ex post budget balance). The Theorem says that decentralized bargaining in this 
case cannot be efficient. In the terminology of the Coase Theorem, private information 
creates a "transaction cost". 

Cramton-Gibbons-Klemperer (1987) show that when the object is divisible, and is 
jointly owned initially, efficiency can be attained if initial ownership is sufficiently evenly 
allocated. For example, suppose that the buyer initially owns y G [0, 1] of the object. 
Efficiency can be achieved when y is sufficiently close to 1/2. Thus, when a partnership is 
formed, the partners can choose initial shares so as to eliminate inefficiencies in dissolution. 
While this result can be applied to study optimal initial allocations of property rights, it 
does not explain why property rights are good at all. That is, interim IRs stemming from 
property rights can only hurt the parties in the model. For example, the parties could 
achieve full efficiency by writing an ex ante contract specifying the AGV mechanism and 
not allowing withdrawal at the interim stage. One would have to appeal to a difficulty of 
specify in complicated mechanisms such as AGV to explain why the parties would look 
for optimal property rights that facilitate efficient renegotiation rather than specifying an 
efficient mechanism ex ante without allow in subsequent withdrawal or renegotiation. 

15.10.4 The Revenue Equivalence Theorem in Auctions 

Let us consider again the setting of allocating an indivisible object among n risk-neutral 
buyers: Y = {y G {0, l} n : J2i V% — 1}> an d the payoff of buyer % is 

Oiyi + U. 

The buyers' valuations are independently drawn from [6^ 0i\ with Q_ { < 6i according to a 
strictly positive density </?«(•), and c.d.f. denoted by $«(•)• 

Suppose that the object initially belongs to a seller (auctioneer), who is the residual 
sink of payments. The auctioneer' s expected revenue in a choice rule (y(-),ti, • • • ,t n ) 
can be written as 

-Y,E e U{0) = E o[&iV0) ~ Ui{6)\ = E e[0iVi{0)\ ~ Ee^e-M^i, Ml- 

i i i i 
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The first term is the agents' expected total surplus, while the second term is the 
sum of the agents' expected utilities. By Proposition 15.10.3, the second term is fully 
determined by the decision (object allocation) rule y(-) together with the lowest types' 
interim expected utilities Eo_ i [Ui{9_ i , 9-i)]- Since the total surplus is pinned down as well, 
we have the following result. 

Theorem 15.10.2 (The Revenue Equivalence Theorem) Suppose that two differ- 
ent auction mechanisms have Bayesian-Nash Equilibria in which (i) the same decision 
(object allocation) rule y(-) is implemented, and (ii) each buyer i has the same inter- 
im expected utility when his valuation is Q_ i . Then the two equilibria of the two auction 
mechanisms generate the same expected revenue for the seller. 

Note that even when the decision rule and lowest agents' utilities are fixed, the seller 
still has significant freedom in designing the auction procedure, since there are many ways 
to achieve given interim expected transfers £«(#«) with different ex post transfer t,i{9). 

For example, suppose that the buyers are symmetric, and that the seller wants to 
implement an efficient decision rule y*(-) , and make sure that the lowest- valuation buyers 
receive zero expected utility. We already know that this can be done in dominant strategies 
- using the Vickrey (second-price sealed-bid) auction. More generally, consider a k th price 
sealed-bid auction, with 1 < k < n. Here the winner is the highest bidder, but he pays 
the k th highest bid. Suppose that buyers' valuations are i.i.d. Then it can be shown 
that the auction ha a unique equilibrium, which is symmetric, and in which an agent' s 
bid is an increasing function of his valuation, b(9i). (See, e.g., Fudenberg- Tirole "Game 
Theory," pp. 223-225.) Agent i receives the object when he submits the maximum bid, 
which happens when he has the maximum valuation. Therefore, the auction implements 
an efficient decision rule. Also, a buyer with valuation 9 wins with zero probability, hence 
he has zero expected utility. Hence, the Revenue Equivalence Theorem establishes that 
k th price sealed-bid auctions generate the same expected revenue for the seller for all k. 

How can this be true, if for any given bid profile, the seller receives a higher revenue 
when k is lower? The answer is that the bidders will submit lower bids when k is lower, 
which exactly offsets the first effect. For example, we know that in the second-price 
auction, buyer will bid their true valuation. In the first price auction, on the other hand, 
buyers will obviously bid less than their true valuation, since bidding their true valuation 
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would give them zero expected utility. The revenue equivalence theorem establishes that 
the expected revenue ends up being the same. Remarkably, we know this without solving 
for the actual equilibrium of the auction. With k > 2, the revenue equivalence theorem 
implies that buyers will bid more than their true valuation. 

Now we can also solve for the seller' s optimal auction. Suppose the seller also has 
the option of keeping the object to herself. The seller will be called "agent zero", her 
valuation for the object denoted by 6> , and whether she keeps the object will be denoted 
by yo G {0, 1}. Thus, we must have Y^i=oUi = 1- 

The seller' s expected payoff can be written as 

n n 

6 E e y {6) + Expected Revenue = 9 E e y (9) + Eg[9 iyi {9)] - ^ E^EgjU^i, 9^)]. 

i=i i=i 

Thus, the seller' s expected payoff is the difference between total surplus and the agents' 
expected information rents. 

By Proposition 15.10.3, the buyer' s expected interim utilities must satisfy 

Eeju^e^)] = Ee_ i [U i (e i ,6- i )]+ E e _Mr,0- t )dr,ye t ee t . (BICFOC) 

Jli 

By the buyers' interim IR, Eg_ i \Ui(9_ i , 9-i)} > 0,Vi, and for a given decision rule y, the 
seller will optimally chose transfers to set Eg^Ui^, 9-i)] = 0, Vi (due to SCP, the other 
types' IR wil then be satisfied as well - see Lemma 15.7.1 above.) Furthermore, just as in 
the one-agent case, upon integration by parts, buyer V s expected information rent can 
be written as 

EqEq 



1 y m 



km 

where hi($i) = j- is the hazard rate of agent i. Substituting in the seller's payoff, 

1 — <Pi{9i) 

we can write it as the expected "virtual surplus" : 



Eg 



(0) 



Finally, for i > 1, let z^(#j) = 9; L — l/hi($i), which we will call the "virtual valuation" 
of agent i. Let also ^o(#o) = 9 . Then the seller's program can be written as 

x(-) 

Eg_ i yi(9 i ,9_ i ) is nondecreasing in 9, h \/i > 1. (BM) 
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If we ignore Bayesian Monotonicity (BM) above, we can maximize the above expec- 
tation for each state 9 independently The solution then is to give the object to the agent 
who has the highest virtual valuation in the particular state: we have Hi(9) = 1 when 
Vi(9i) > Vj(9j) for all agents j ^ %. The decision rule achieving this can be written as 
y{9) = y* (u (9o) , ui(9i) , ■ ■ ■ ,^ n (#n)), where y*(-) is the first-best efficient decision rule. 
Intuitively, the principal uses agents' virtual valuation rather than their true valuations 
because she cannot extract all of the agents' rents. An agent's virtual valuation is lower 
than his true valuation because it accounts for the agent' s information rents which cannot 
be extracted by the principal. The profit-maximizing mechanism allocates the object to 
the agent with the highest virtual valuation. 

Under what conditions can we ignore the monotonicity constraint? Note that when 
the virtual valuation function Ui{9i) — 9i — l/hi(0i) is an increasing function 4 , then an 
increase in an agent' s valuation makes him more likely to receive the object in the solution 
to the relaxed problem. Therefore, yi(9i,9_i) is nondecreasing in 9^ for all and so by 
Proposition 15.10.2 the optimal allocation rule is implementable not just in BNE, but 
also in DS. The DIC transfers could be constructed by integration using DICFOC; the 
resulting transfers take a simple form: 

U(9) = Pi (0- i )y i (6), where = inf{0, e [9_J % ] : j/^M-i) = !}■ 

Thus, for each agent % there is a "pricing rule" Pi(9-i) that is a function of others' 
announcements, so that the agent wins if his announced valuation exceeds the price, and 
he pays the price whenever he wins. This makes truth-telling a dominant strategy by the 
same logic as in the Vickrey auction: lying cannot affect your own price but only whether 
you win or lose, and you want to win exactly when your valuation is above the price you 
face. 

On the other hand, if virtual valuation functions are not increasing, (BM) may bind, 
and an "ironing" procedure should be used to identify the regions on which it binds. 

Since the principal discounts agents' true valuations, she is less likely to sell the object 

than would be socially optimal. When she does sell the object, will it go to the agent 

with the highest valuation? This depends: 

4 A sufficient condition for this is that the hazard rat hi{6i) is a nondecreasing function. This is the 
same argument as in the one-agent case (note that v v qq — for the linear utility function v(y,0) = 9y). 
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• The Symmetric Case: Agents' types are identically distributed. Then they all have 
the same virtual valuation function v(-). Suppose it is increasing. Then whenever 
the principal sells the good, it must go to the agent with the highest valuation 6>j, 
because he also has the highest virtual valuation v{9i). The principal will sell the 
good when z/(maxj>i 9,j) > 9 . 

The DS implementation of the optimal allocation rule is the Vickrey auction in 
which the seller submits a bid equal to p — v~ 1 (9 ), called the "reserve price". It 
exceeds the principal' s value 9 for the object, thus the object is less likely to be 
sold than in the social optimum. The intuition is that the principal reduces agents' 
consumption to reduce their information rents, just as in the one-agent case. 

• The Asymmetric Case: The agent with the highest virtual valuation is now not 
necessarily the one with the highest true valuation. For example, suppose that we 
have two bidders, and h\{9) < h 2 (9) for all 9. What this inequality means intuitively 
is that in an ascending price auction, where an agent drops out when the price 
reaches his true valuation, bidder 1 will be less likely to drop out at any moment 
conditional on both bidders staying in the game. Thus, the inequality means that 
bidder 1 is in a certain stochastic sense "more eager" to buy than bidder 2. 5 Then 
v\ (9) < v 2 (9) for all 9, i.e., the optimal auction should be biased in favor of the 
"less eager" agent 2. One intuition for thi is that the bias increases "competition" 
among the bidders. 

15.10.5 Correlated Types 

Using the ideas of Cremer and McLean (1988), we can see that with correlated types, 
"almost anything is implementable" . For simplicity consider the quasi-linear environment 
with two agents and 6; = {L, H} for each % (Cremer-McLean have arbitrarily finite 
numbers of agents and types). Let the joint probability matrix of 9 2 ) be as follows: 

n(L,L) 7r(L, H) 
tt(H,L) tt(H,H) 

5 In fact, it obtains whenever the distribution of bidder V s valuation dominate that of bidder 2' s 
valuation in the Monotone Likelihood Ratio ordering. 
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(Note that the entries must add up to 1.) Correlation means that the determinant of the 
matrix is nonzero. Suppose for definiteness that it is positive: 



?r(L, L)ir(H, H) - ir(L, H)n(H, L) > 0, 

which corresponds to positive correlation (otherwise we could relabel L and H for one of 
the agents). 

Also denote the marginal and conditional probabilities as follows: 

niPi) = 7r(e t ,L) + 7T(e z ,H), 

n-iie-ilOi) = tt(^,MM(^)- 

Assume that the marginal densities 7Tj(0j) > (otherwise, agent % has only one possible 
type). 

Also, let 0' denote the type other than 9. 

Proposition 15.10.4 For an arbitrary choice rule (?/(•), ti(-), t 2 (-)) ; there exists a BIC 
choice rule (y(-), ii(-), ^O) w ^ ^ e sam & decision rule y(-) and the same interim expected 
transfers: 

Ee_ l [ti(9 i ,9^ i ) | 9i\ = E e _.[t i (0 i ,0_ i ) \ O^i = 1,2,0; = L,H. 



Ufa, 0-i) + A • 7r_i(^ | 6i) when 0_, = 9 h 
t t (9 u 6-i) - A • n.i(9i | 9i) when 9_ t ^ 9 { . 



Proof. Construct the new transfers as follows: 

ti(9i,9_i) = 

Then we have 

E e _M^8-i) I 9 i ]-E e _M9 i ,9- i ) \ 0,] = n^(0 t \ i )[ir- i (0' i | i )A]+7i_ l (0' t \ i )[-Air i {0 i \ 9,)}. 

At the same time, the new mechanism is BIC when A > is large enough. Indeed, the 
equilibrium interim expected utilities are not affected, while the interm expected utility 
from lying and announcing 0- ^ 0, is 

$i(0' | 9) = $;(0' | 9) + 7^(0' | 9)1^(9 I 0')A] + 7T_;(0 I 0)[-7i^(0' I 0')A] 
to (ff i fl x , n(0>,0)n(0,0')-n(0,0)rr(0>,0>) A 

= He 1 e) + A - 
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By the correlation assumption, the fraction is negative, hence &(6' | 9) can be made as 
small as desired by choosing a large enough A > 0. 

Intuitively, the proof gives agent i a lottery contingent on the other type, whose 
expected value is zero if agent i is truthful but negative if he lies. If the stakes in this 
lottery are sufficiently high, BIC will b satisfied. 

For example, suppose the principal wants to maximize profit. The first-best could be 
achieved by having the first-best decision rule ?/*(•) while eliminating the agents' infor- 
mation rents at the same time. Indeed, first choose transfers to be 

m) = ^-^(M-M), 

where iii(0i) is the utility of agent i of type 6i if he withdraws from the mechanism. 
This makes all types' IRs bind (both interim and ex-post). Then we can construct the 
interim-equivalent BIC according to the above Proposition. 

One problem with this argument is that when A is large enough, lying will also con- 
stitute a BNE. But this problem can actually be avoided, because it turns out, that the 
principal can fully extract surplus even while preserving DIC. 

Proposition 15.10.5 For any DIC choice rule choice rule (y(-),ti(-),t 2 (-)) there exists 
another DIC choice rule (y(-),ti(-),t 2 (-)) in which the interim IRs bind. 

Proof. Construct the new transfers as 

i i (6)=t i (e) + h i (9- i ), 
To make agent i's interim IR bind, we need 

E e _ i [u i {e i ,e- i ) + h i {e- i )\e i ] = u i {e i )- 

Writing this out for both types, we have 

7r_i(L | L)hi(L) + n-i(H | L)hi(H) = u^L) - E e _.[Ui{L, 0_O \ L], 
7r_i(L | H)hi(L) + n-i(H | H)hi(H) = Ui(H) — E e _ i [U i (H,9_, i ) | H\. 

This is a system of linear equations, whose determinant is 

7Yi{L)7ri{H ) 
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by the correlation assumption, therefore a solution exists. 

In particular, the principal can achieve the first-best by using a VCG mechanism and 
extracting all surplus from the agents. 

Cremer and McLean formulate conditions for BNE and DS implementation for more 
than two agents and types. The condition for DS implementation is proved stronger than 
the one for BNE implementation, but both hold for generic joint distributions of types. 

15.10.6 Ex Post Implementation 

Standard auction theory assumes the seller knows the distribution F from which the 
buyers' valuations are drawn. For instance, consider a multi-unit auction when each 
buyer can buy at most 1 unit. The optimal auction then sells to buyers in order of their 
valuations while the virtual valuation u(9i) exceeds the marginal cost c(X) — c(X — 1). 
This can be represented by a supply curve i*~ l (c(X) — c(X — 1)). In the linear-cost case, 
when the marginal cost is a constant c, there is no reason to use an auction and can post 
a price z/ -1 (c). (The same is true if buyers are small and aggregate demand and hence 
equilibrium marginal cost are predictable.) 

But if the distribution F is not known, the monopoly does not have a demand curve. 
I.e., suppose that buyers' valuations are i.i.d draws from an unknown distribution F over 
which the designer has a Bayesian prior. This makes buyers' types correlated. What 
mechanism should be used? We now impose ex post mechanism design. The optimal 
mechanism maximizes expected virtual surplus s.t. the relevant monotonicity constraint: 

max E e [YJU "Wi I MlfeW " * (H=i vM)] 

x(-) 

s.t. yi(9i,0-i) is nondecreasing in 9i, Vi > 1. (DM) 

Here v{9i \ 9-i) = 0i~ i^§m^~\ is buyer i's conditional virtual valuation/marginal revenue 
function (by symmetry it is the same for all buyers). 

For simplicity focus on the case of constant marginal cost c. Then if we ignore (DM) 
the optimal mechanism sells to buyer i iff v{9i \ 9-i) > c. If the virtual utility v{9{ \ 9-i) is 
nondecreasing in 9i, then the mechanism will in fact satisfy (DM) and can be implemented 
by offering each buyer the optimal monopoly price p(9-i) given information gleaned from 
others' reports and selling iff his announced valuation is above the price. (Note that 
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agents make announcements first and then prices are determined.) As the number of 
agents n — > oo, p{9-i) will converge to the optimal monopoly price by the law of large 
numbers, and so the seller' s profit will converge to that he would get if he knew F. 

Note that the mechanism is fundamentally different from traditional auctions: here a 
bid of buyer i affects allocation to other bidders directly rather than only through buyer 
€s allocation. This effect through the the "informational linkage" among buyers, rather 
than the "technological linkage" in the standard auction through the seller's cost function 
or capacity constraint. 

15.10.7 Ex post Implementation with Common Values 

In the quasi-linear environment with common values, agent i's payoff is Vi(x, 9)+ti, which 
depends directly on 

The problem of correlation comes up again. We can have a setup of independent 
common values, such as the "wallet game" where all agents care about J2i ®%- But the more 
natural model is the "mineral rights" model, in which types are i.i.d. signals conditional on 
the unknown true state, and thus correlated a priori. Most of the literature on common- 
value auctions is "positive" - solves for an equilibrium and compares several practical 
mechanisms, rather than looking for an optimal mechanism. This research was spurred 
by Milgrom and Weber. Recently, there has been new work on optimal design restricting 
attention to "ex post implementation", which imposes the following "ex post IC": 

v i (x(e),9) + t i (e i ,e- i ) >vM9 i ,9- i ),e) + t i (§ i ,9- i ),w i ,e i e e l; v^ e 0-,. 

This means that truth-telling is optimal for agent % regardless of what types other 
agents have, but assuming they follow their equilibrium strategy, i.e., report truthfully. 
Without the assumption, the other agents' reports could 9'_ i while their true types are 
9-i, in which case agent i's utility would be Vi(x(9i, d'_i), &i, 9-i) +U(9i, 9-i), and it would 
be impossible to ensure that truth-telling is always optimal for agent i. Note that the 
assumption is not needed with private values, in which case ex post IC coincide with DIC. 
In addition, ex post implementation imposes ex post IR. See Dasgupta and Maskin (QJE 
2000). 
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Chapter 16 

Dynamic Mechanism Design 



Chapters 13 - 15 deal only with static or one-shot contracting situations. In practice, 
many relations are repeated or long term. In this chapter we will study dynamic contract 
theory. 

A number of new fundamental economic issues arise when the parties are involved in 
a long-term contractual relation. How is private information revealed over time? How is 
the constrained efficiency of contractual outcomes affected by repeated interactions? How 
does the possibility of renegotiation limit the efficiency of the overall long-term contract? 
To what extent can reputation serve as a more informal enforcement vehicle that is an 
alternative to courts? We discuss some of these issues in this chapter. 

We will study long-term incentive contracting in a dynamic principal-agent setting 
with one-agent and adverse selection. We first assume that the principal (designer) can 
commit to a contract forever, and then consider what happens when she cannot commit 
against modifying the contract as new information arrives. 

16.1 Dynamic Contracts with Commitment 

Methodologically, there is no significant change in analyzing optimal multiperiod contracts 
as long as the contracting parties can commit to a single comprehensive long-term contract 
at the initial negotiation stage. When full commitment is feasible, the long-term contract 
can essentially be reduced to a slightly more complex static contract involving trade of 
a slightly richer basket of state-contingent commodities, services, and transfers. What 
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this conclusion implies in particular for contracting under hidden information is that the 
revelation principle still applies under full commitment. 

16.1.1 Dynamic P-A Model with Constant Type 

For this purpose, we consider a principal-agent relationship that is is repeated over time, 
over a finite or infinite horizon. First, as a benchmark, we suppose that the agent's type 
9 is realized ex ante and is then constant over time. 

An outcome in this model can be described as a sequence of decisions and payments 
(xt,yt)tlo- Both parties have the same discount factor < 5 < 1 (in a finite-horizon 
model, discounting does not matter). Suppose the payoffs are stationary: 

Principal: Y.Zo 5< [Vt ~ c ( x t)\, 
Agent: Et=o ^H x u 0) ~ Vt]- 

Suppose the Principal can commit ex ante to a mechanism, which cannot be modified 
later on. This can be understood as a standard mechanism design setting in which the 
outcome is the whole sequence of decisions and payments. By the Revelation Principle, 
we can restrict attention to direct revelation mechanisms, in which the agent announces 
his type 9, and the mechanism implements a sequence (x t (9), y t (9))%l according to his 
announcement. 

Does the principal need to use mechanisms that are not stationary, i.e., specify different 
outcomes in the two periods? The answer is no: An outcome can be regarded as a 
stationary randomization (lottery) that yields, in each period each (x t , t t ) with probability 
Then the parties' expected utilities will be the same as from the original non- 
stationary outcome. Thus, the stationary setting with constant types and commitment 
of the optimal one-period randomizes contract. 

Under some conditions, the principal will not resort to lotteries. A simple case is where 
the functions c(-) and v(-,9) are linear in x: In this case the randomized outcome (x,t) 
can be replaced with a deterministic outcome (Ex, Et), which gives both parties exactly 
the same utilities. More generally, a set of sufficient conditions for the principal not resort 
to randomizations is given by c"(-) > 0, v xx (-) < 0, and vg xx (-) > (see Fudenberg-Tirole 
pp. 306-7). 
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16.1.2 Dynamic P-A Model with Markov Types 

Suppose now type can change: let t denote the type in period t, and suppose that types 
follow a Markov process. This captures many dynamic contracting situations in which 
agents learn information over time such as: 

• Consumers buy calling plans and then learn their valuations for calls on a given day. 

• Travelers buy tickets and then get new information about their valuation for the 
trip (e.g., sickness, busy schedule, etc.) 

• A buyer joins a wholesale club for a fee and then learns his valuations for goods. 

Principal: Y.Zo ^ [Vt ~ c { x t)\, 
Agent: YZ^H^A) ~ Vt]- 

We now allow for non-stationary payoffs now (and in particular finite horizon, or 
even perhaps trade does not occur until some period T even though information 
arrives earlier and the agents sends messages. 

The model has "multidimensional type" and (even though with restriction that an 
agent cannot report a type before he observes it), and so it is quite complicated. 

How do optimal screening mechanisms look in such situations? We can still use a 
version of the Revelation Principle for dynamic games to restrict attention to mecha- 
nisms in which the Agent reports new information as it arrives and reports truthfully 
in equilibrium. The mechanism then specifies, for each t, each Xt(0 t ) and yt(0*), where 
0* — • • • , t ) is the reporting history at time t. 

Remark 16.1.1 The agent' s complete strategy specifies what to report if he lied in the 
past, but the Markovian assumption means that his true past is no longer payoff-relevant 
(even if his past reports may be), so if he finds it optimal to be report truthfully after 
truthful reporting he will also find it optimal to report truthfully after misreporting. Using 
the "one-stage deviation principle" , we will check the IC that the agent does not want to 
deviate to lying about t for any history t_1 . 
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Let UtiO 1 ) denote continuation expected utility of type 9 t in period t from truthful 
announcement for reporting history 9 l ~ l . 

Consider first the IRs. It is easy to see that even if we require participation constraints 
in all periods (i.e., > 0,V£, only zero-period IRs can bind: Any other IR in 

period t > 1 could be relaxed by "bonding" the agent - i.e., charging him a large payment 
M in period zero and returning it in period t + 1. (There are real-life examples of such 
bonding: e.g., long-term life insurance contracts, which consumers can cancel at any time, 
have premiums higher than risk in the beginning of the contract, in exchange for premia 
below risk later in life. This ensures that consumers who find out that they are healthy 
and likely to live long don't cancel the contract.) 

Since only lst-period IR may bind, what determines distortion is extraction by the 
principal of the agent' s zero-period info rent Uq[6q). 

To calculate this rent, suppose that all 9 t e [9, 9], the Markov process is described by 
a smooth cdf F t (9 t \ 9 t -i), and v t has a bounded derivative in 9 t . Then we can get ICs for 
1-stage deviations from truth-telling as follows: 



E/t(0*) = max 



v t {x t {9 t -\9 t ),9 t ) + 8 J U t+1 (9 t - 1 ,9 t ,9 t+1 )dF t+1 (9 t+1 \9 t ) 



Applying the Envelope Theorem yields (a.e.): 







dvt{xt{9 1 
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de t 




dv t {xt{9 1 







dOt 89+ 1 "J ' 89 t 



Ovt+l 



-5E 



J U t+1 (9^)- 

U t+1 (9 t+1 ) dF t+1 (9 t+ i | 9 t )/d9 t 



9, 



99 t+ i ft+i(9t+i | Ot) 

(where the 2nd equaility obtains by integration by parts using the fact that l9Ft+ g^ 1 ^ i - > = 
for 6t+i = 9 and 9 t+l = 9). 

So, working backward from the infinite horizon and applying iterated expectations, 
we get 

um = B X/ n r-( /,(«« | eu> ) W, ' 

We need to figure out which of the period-1 IRs binds. Assume FOSD: dF t (9 t \ 
Ot-i)/dO t -i < 0. Then dU (9 ) / d9 > 0, and so only IR binds. Intuitively, the distortion 
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will then be in the direction of reducing Uq(9q), since this reduces all types' information 
rents. Under FOSD, this is achieved by reducing the derivative w hich under 

SCP is achieved by reducing ir t (0*). To see this more formally, we can write virtual surplus 
in period t in state 6 as 



f(0) T - 1 \ 3o T _! ) w, 

The relaxed problem maximizes this expression in all periods and states. Properties 
of solution to the relaxed problem: 

• Downward distortion in each period. 

• For 6> = 9, no distortions forever. 

• For t > 1, if 9t — 9 or # t = 9, then no distortions from then on (since then 
dF t (9 t | t _i)/30 t _i = 0). 

Does this solve the full problem - are all ICs satisfied? In 2-period model with linear 
utility and FOSD, Courty-Li show that all ICs are satisfied when Xt{9 1 ) is nondecreasing 
in all arguments, and that this is satisfied when 9 t = a~ 1 {b{9 t ^ 1 ) + e t ) (including e.g. 
additive and multiplicative case), then F(9 t \ 9 t -\) = G(a(9 t ) — b(6 t -i)), and so F 9t _ 1 (9 t \ 
9t-i)/f(9t | 9 t -i) = —b'(9 t -i)/a'(9 t ). So when b(-) is increasing and convex and a(-) is 
increasing and concave we are OK. 

Discussion about dynamic multi-agent model can be seen in Athey and Segal (2007). 



16.2 Dynamic Contracts without Full Commitment 

When the contracting parties are allowed to renegotiate the initial contract as time unfolds 
and new information arrives, new conceptual issues need to be addressed and the basic 
methodology of optimal static contracting must be adapted. Mainly, incentive constraints 
must then be replaced by tighter renegotiation-proofness constraints. 

16.2.1 Basic Problem 

Now we examine the temptations to modify the original contract and how they affect the 
outcome with imperfect commitment. For simplicity we focus on a 2-period P-A model 
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with a constant type and stationary payoffs. That is, the payoffs a 

Principal: y x - c(x x ) + 5[y 2 - c(x 2 )}, 
Agent: v(x u 9) - y x + 5[v(x l ,9) - y x }. 

The agent' s type 9 is realized before the relationship starts and is the same in both 
periods. Suppose the type space is = {L,H}, and let Pr(9 = H) — /j, e (0,1). We 
will allow 5 to be smaller or greater than one — the latter can be interpreted as capturing 
situations in which the second period is "longer" than the first period (we avoid analyzing 
more than 2 periods, since the model proves already complicated as it is). 

The outcome of contracting in this model depends on the degree of the principal' s 
ability to commit not to modify the contract after the first period. 

If the principal cannot commit, however, she will be tempted to modify the optimal 
commitment contract using the information revealed by the agents in the first period. 
Recall that the optimal commitment contract is a repetition of the optimal static contract. 

There are two reasons for the principal to modify the contract: 

1. The high type receives information rents in the optimal contract (point H). How- 
ever, after the first period, the high type has already revealed himself. Then the 
principal can extract all the information rents from the high type by offering point 
H' instead. This is called the Racket Problem. Intuitively, once the agent has re- 
vealed that he has a high value for consumption, the principal charges more. If 
the high type anticipates this modification ex ante, his ex ante incentive constraint 
becomes violated: he would rather pretend being the low type in the first period by 
choosing point L. This will not change his first-period payoff (sine (IC HL ) binds), 
but allows him to keep information rents in the 2nd period. Examples of ratchet 
problems: labor contracts, regulation of natural monopolies, etc. 

2. After a low type reveals himself by choosing point L, the principal can raise her 
second-period payoff by eliminating the downward distortion in the low type's con- 
sumption in the second period. Namely, instead of point L she will offer the low 
type the efficient point V . This is called the renegotiation problem, because this 
modification can be done with the agreement of the low type. (In fact, the low type 
buyer will be indifferent between L and L', but the principal can make her strictly 
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prefer the change by asking for e less in the transfer.) The renegotiation problem 
was first considered by Dewatripont (1988, 1989). 

If the high type anticipates this modification, in the first period he will pretend 
being the low type by choosing point L instead of point H, and then receiving point 
V in the second period. This will not change his first-period payoff (since (IC HL ) 
binds), but will increase his second-period payoff, since V gives him a higher payoff 
than H . 

Examples of renegotiation problems: soft budget constraints. 

Both these ex post modifications improve the principal' s payoff post, however, they 
destroy the ex ante incentive-compatibility of the contract. 

Observe that in an equilibrium without the principal' s commitment, the principal 
cannot do better than if she could commit. Indeed, any choice rule that results in an 
equilibrium without commitment will satisfy the agent's IC and IR constraints, therefore 
it can also be implemented by an IC and IR commitment mechanism. The reverse is not 
true: the principal' s inability to commit will limit the set of implementable choice rules 
and reduce her expected ex ante profits. 

To what extent can the principal commit not to modify the contract and avoid the 
ratchet and renegotiation problems? The literature has considered three degrees of com- 
mitment: 

1. Full Commitment: The principal can commit to any contract ex-ante. We have 
already considered this case. The Revelation Principle works, and we obtain a 
simple replication of the static model. 

2. Long Term Renegotiable Contracts: The principal cannot modify the contract 
unilaterally, but the contract can be renegotiated if both the principal and the agent 
agree to it. Thus, the principal cannot make any modifications that make the agent 
worse off, but can make modifications that make the agent better off. Thus, the 
ratchet problem does not arise in this setting (the high type would not accept a 
modification making him worse off) but the renegotiation problem does arise. 

3. Short-Term Contracts: The principal can modify the contract unilaterally after 
the first period. This means that the ex ante contract has no effect in the second 
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period- i.e., it can only specify the first-period outcome, and after the first period 
the parties contract on the second-period outcome. This setting gives rise to both 
the ratchet problem and the renegotiation problem. 

A key feature of the cases without commitment is that when is sufficiently high, the 
principal will no longer want the agent to reveal his type fully in the first period she 
prefers to commit herself against contract modification by having less information at the 
modification stage. For example, if the two types pool in the first period, the principal can 
implement the optimal static contract in the second period, and the problems of ratcheting 
and renegotiation do not arise. When 5 is very high, this will indeed be optimal for the 
principal. For intermediate levels of S, the principal will prefer to have partial revelation 
of information in the first period, which allows her to commit herself against modification. 
This partial revelation will be achieved by the agent using a mixed strategy, revealing his 
type with some probability and pooling with some probability. 

16.2.2 Full Analysis of A Simple Model 

Since the analysis of models without full commitment is quite difficult, we will study the 
simplest possible model in which the ratchet and renegotiation problems arise. In the 
model, based on Hart and Tirole (1990), the principal is a monopolistic seller who can 
produce an indivisible object in each period at a zero cost. The agent' s valuation for 
this object in each period is 9, thus his per-period utility is 9x — t, where x G {0, 1} is 
indicates whether he receives the object, and 9 G {L, H} is his privately observed type, 
with < L < H, and Pr(# = H) = /i G (0, 1). This is a particular case of the two-type 
principal-agent model considered before. 

The optimal static contract 

The simplest way to solve for the optimal static contract is using the taxation principle. 
Suppose the principal offers a tariff T : {0, 1} — > R. Since the agent can reject the tariff, 
we can assume that T(0) = 0. Thus, we can look for the optimal T(l) = p - price for the 
object. 

Faced with the tariff, the agent solves max xe { ^y(9x — px). Thus, he chooses x(9) = 1 
if 9 > p and x{9) = if 9 < p. Let x = fix(H) + (1 — fi)x(L) - the expected demand faced 
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by the principal. Then (assuming the principal can convince the agent to buy when he is 
indifferent) we have 



The principal will choose p to maximize expected revenue px(p). There are only two 
potentially optimal alternatives: setting p = L and selling to both types yields expected 
revenue L, and setting p = H and selling only to the high type yields expected revenue 
fxH. Thus, the principal' s optimal price is 



Intuitively, if there are sufficiently many high types, the principal prefer to target only 
them by setting price H and making the sale with probability, but if there are few high 
types the principal sells price L and sells with probability 1. (Exercise: obtain the same 
result using the Revelation Principle instead of the Taxation Principle.) 

Note that when p* (fx) = H, both types consume efficiently, but the high type receives 
information rents. When p*(fx) = L, on the other hand, the low type underconsumes, but 
the high type receives no information rent. The slight difference from the basic principal- 
agent mode is that the agent' s consumption choices are never interior, so the high type' 
s information rents and the low type' s underconsumption need not always be strictly 
positive. However, the basic trade-off between efficiency of the low type' s consumption 
and extraction of the high type' information rents is the same as in the basic model. 

Dynamics with Full Commitment 

Suppose now that there are two periods. We already know that any two-period contract 
can be replaced with a repetition of the same (possibly randomized) static contract. In 
our case, the optimal static con- tract is setting the price p*(fx). (Since the parties' 
payoffs are linear both x and t, randomizations need not be used.) Therefore, the optimal 
commitment contract sets the same price p* (fx) = L in both periods. 



x(p) 



if p>H, 

< fx if pe(L,H], 

1 ifp<L. 




L if fx < L/H, 

H if /X > L/H, 

either L o H if fx = L/H. 
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If the principal lacks full commitment, will she want to modify this contract after the 
first period? This depends onp*(/i). If /i < L/H and therefore p*(n) = L, then the two 
types pool in the first period, and the principal receives no information. Then she has no 
incentive to modify the same optimal static contract in the second period. 

However, when /i > L/H and therefore = H, the principal knows following a 

purchase that she deals with a high type and following no purchase that she deals with 
a low type. In the latter case, she will want to restore efficiency for the low type by 
the price to L in the second period - the renegotiation problem. Anticipating this price 
reduction following no purchase, the high type will not buy in the first period. As for 
the former case, the principal already extracts all of the high type' s information rents, 
so the ratcheting problem does no arise at the optimal commitment contract. We will 
see, however, that when the principal deals optimally with the renegotiation problem, the 
ratcheting problem may arise and further hurt the principal. 

Dynamics with Short-Term Contracts 

Let fi > L/H, and let the parties play the following two-period game: 

• Principal offers price p\. 

• Agent chooses X\ G {0, 1}. 

• Principal offers price p 2 - 

• Agent chooses x 2 G {0, 1}. 

We solve for the principal' s preferred weak Perfect Bayesian Equilibrium (PBE) in this 
extensive-form game of incomplete information. We do this by first considering possible 
continuation PBEs following different first-period price choice pi. Then the principal' s 
optimal PB can be constructed by choosing the optimal continuation PBE follow- ing any 
price choice pi, and finding the price p\ that gives rise to the best continuation PBE for 
the principal. 

The principal' s strategy in the continuation game is a function ^2(^1) that sets the 
second-period price following the agent' s first period choice x\. This price should be 
optimal given ^2(^1) = Pr(0 = H \ x\) - the principal' s posterior belief that the agent is 
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a high type after x\ was chosen. From the analysis of the static model we know that the 
principal' s 2nd period price must satisfy 



Pi{x\) =p*(fi 2 (x 1 )) 



L if fi 2 (xi) < L/H, 

H i£ f i 2 (x 1 )>L/H, 
either L or H if ^2(^1) = L/H. 

"Full Revelation Equilibrium" Can we have a continuation PBE in which Xi(H) = 
1 and Xi(L) = 0? In this case we have /^(O) = and yU 2 (l) = 1, and therefore Pi(0) = L 
and ^2(1) = H. What should be the first-period price p\ for this to happen? If the high 
type does not buy in the first period, this reduces the price he pays in the second period. 
Thus, in order to induce him to buy, the first-period price must be sufficiently low. More 
precisely, we must have 

H- Pl + 5{H-H) > ^ +5(H-L), 

buy at r = 1 no rents at r = 2 don,t bu y at T = 1 rents at r = 2 

or 

Pi < (1 - 5)H + 5L. 

As for the low type, he is willing to pay exactly L in the first period (he receives 
no surplus in the 2nd period in any event). Thus, when 5 G (0,1), the optimal "Full 
Revelation" equilibrium has the principal selling to the high type only in the first period 
at price p± — (1 — 5)H + 5L e (L,H), and the low type does not buy. The principal 
expected profit is 

ir F = i2[(l-5)H + 5L + 5H] + (l-ij,)5L = fiH + 5L. 

However, when 5 > 1, the principal cannot sell to the high type in the first period 
without selling to the low type, since (1— S)H+SL < L. Then the Principal cannot achieve 
Full Revelation with short-term contracts. Intuitively, when the future is important, the 
ratchet effect is so large that the high type needs to be offered a good deal to reveal 
himself in the first period. But then the low type would "take the money and run". This 
makes full revelation impossible. 

"No Revelation Equilibrium" Another possibility for the principal is to have the 
agent always make the same choice in the first period. The principal' s second-period 
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belief in equilibrium is /i, and the second-period equilibrium price is therefore H. Note 
that the low type will be willing to pay price L in the first period (he receives no surplus 
in the second period in any event). Thus, given that there is no revelation, it will be 
optimal for the principal to sell to both agents in the first period at price L. Since xi — 
is a zero-probability event, the belief /^(O) can be chosen arbitrarily. In particular, we 
can choose /^(O) > L/H, so that p 2 (0) = P2O-) — H. The high type will then also be 
willing to buy at price L (he would not gain anything by not buying). The principal' s 
expected profit in this candidate equilibrium is 

ttm = L + 8/iH. 

Which continuation equilibrium is better for the principal? Note that 

Tip — 7i~N = (fJ>H — L){\ — 5). 

Recall that we now consider the case where p, > L/H. In this case, when 5 < 1, Full 
Revelation is feasible and it is better for the principal than No Revelation. On the other 
hand, recall that when 5 > 1, Full Revelation is not feasible at all. 

"Partial Revelation Equilibrium" The principal may be able to do better than 
either with Full Revelation or Partial Revelation by letting the agent use a mixed strategy. 
In such mixed-strategy equilibrium, the agent' s first-period choice gives some information 
about the agent type but does not reveal it fully. Thus, let the agent randomize in the 
first period, with denoting the probability that the agent of type chooses x± — 1. We will 
consider equilibria in which both first- period choices are made with a positive probability, 
hence both fi 2 (0) and ^(1) are obtained by Bayes' rule (so the "No Revelation" case is 
ruled out). There are four cases to consider: 

1. p 2 (0) =p 2 (l) = L 

2. p 2 (0) = H,p 2 (l) = L 

3. p 2 (0) = L,p 2 (l) = H 

4. p 2 (0) = p 2 (l) = H 

We consider these four cases in turn. 
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Case 1 In this case we must have 7x2(0) < L/H and 7x2(1) < L/H. This is inconsistent 
with the posteriors being obtained by Bayes rule from the prior /x > L/H. 

Case 2 This case is also infeasible, though the reasoning is more complex. Note that the 
payoff of the high type can be written 4>(p, H) = [H — p 1 + S(H — L)]p, while the 
payoff of the low type can be written as 0(p, L) = (L — p ± )p (the low type receives 
zero surplus in the second period). Note that d(f)(p,H)/dp > d(j)(p, L)/dp, hence 
4>(p, 6) has SCP. Intuitively, the high type in a type 2 equilibrium would be more 
eager to buy in the first period since he values the object more and cares more about 
low price in the second period. By MCS Theorem ?? (a), we must have p H > p L in 
equilibrium. But then by Bayes rule we must have 7x2(1) > /x > L/H, and therefore 
^2(1) = H - a contradiction. Therefore, type 2 equilibria do not exist. 

Case 3 In this case we must have /x 2 (0) < L/H and 7x2(1) > L/H. It can be seen that in 
this case the optimal equilibrium for the principal is the Full Revelation Equilibrium 
considered above. 

Case 4 In this case, we must have 7x2(0) > L/H and 7x2(1) > L/H. For the former we 
must have pn < 1, i-e., some high types not buying in the first period. Note that 
since the agent' s first-period choice does not affect the second-period price, the high 
type is willing to pay H and the low type is willing to pay L in the first period. 
Thus, for ph < 1 we must have pi > H. But then the low type will not buy in the 
first period, i.e., pi = 0. 

It will be optimal for the principal to set p\ = H and maximize pn subject to 7x2(0) > 
L/H. This gives a "partial revelation: equilibrium. In this equilibrium the principal 
will maximize the probability of separation of the high type subject to there remaining 
sufficiently man unrevealed high types so that she can credibly promise not to lower the 
price. By Bayes' rule, 

M0) " (l-p^+d-7^) 
since p L = 0. The inequality 7x2 (0) > L/H is equivalent to 

pH-L 



& Ph < P 



' p(H - L) 
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(It is easy to see that p* < 1.) The principal maximizes expected profit by setting p H = p* 
in this candidate equilibrium, which yields profit 

'pH - L 



tip = p[phH + 5H] = 



+ p5 



H. 



H -L 

Optimal PBE When 5 < 1, Full Revelation is Feasible, and tt f > n N . It remains to 
compare ttf to Tip, the profits from Partial Revelation: 

Tip > Tip 



pp*H + p5H > pH + 5L 
& p(5 + p*-l)>5^ 

<£> p(5H-5L + L)>L[l + 5-5 



H 



^ P > 



H 



H + 5(H-L) 
L + 5(H-L) 



>i 



Conclusion for ST contracts when 5 < 1: 

1. If p < L/H then P implements the full commitment optimum with p 1 = p 2 = L, 
and there is no revelation of information. 

2. If L/H < p < jj ^+s(h-l) ' then we the "Full Revelation" equilibrium: 

pi = (-5)H + 5L, 
p 2 (l) = H and p 2 (0) = L. 

• The if type buys in both periods. 

• The L type buys only in the second period. 

3. If p > jj p'+sIh-l) ' then we the "Partial Revelation" equilibrium: 

Pi = p 2 = H. 

• The H type buys in the first period with probability p* = jfpjfzf) < 1; an d 
buys for sure in the second period. 
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• The L type never buys. 

Conversely, when 5 > 1 we know that Full Revelation is infeasible. It remains to 
compare ir N and n P : 

Tip — 7Tjv = WE + p5H] -[L + pSH] = pp*H — L — ^—^ H - L > 

H — L 



L 
H 



L 


L 




> — 


H 


~ H 


L 




H 





L 
H 



Conclusion for ST contracts when 5 > 1: 



1. If fi < L/H, then P implements the full commitment optimum with pi = P2 = L, 
and there is no revelation of information. 

2. If L/H < fj, < jj (2 - §), then we get the "No Revelation" equilibrium: 

pi = L,p 2 = H. 

• The H type buys in both periods. 

• The L type buys only in the first period. 

3. If /j, > jj (2 — then we get the "Partial Revelation" equilibrium: 

pi = p 2 = H. 

• The H type buys in the first period with probability p* = ^Zl) < 1> an< ^ 
buys for sure in the second period. 

• The L type never buys. 
Long Term Renegotiable Contracts 

Now consider the case where the principal can sign a long-term contract with the agent, 
but cannot commit not to renegotiate the contract. The game between the principal and 
the agent can then be described as follows: 

• Principal offers a contract < pi,p 2 (0),p2(l) > 
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• Agent chooses x\ 

• Princial offers p 2 

• Agent accepts p 2 or rejects and sticks to the original contract 

• Agent chooses x 2 

We will look for the principal' s preferred PBE of this game. Not that a renegotiation 
offer p' 2 following the agent' s choice x\ will be accepted by the agent if and only if 
p'2 < V2{xi). Therefore, a long-term renegotiable contract commits the principal against 
raising the price, but does not commit her against lowering the price. 

Analysis is simplified by observing that the principal can without loss offer a contract 
that is not renegotiated in equilibrium. 

Definition 16.2.1 A Renegotiation- Proof (RNP) contract is one that is not renegotiated 
in the continuation equilibrium. 

Proposition 16.2.1 (Renegotiation-Proofness Principle) For any PBE outcome of 
the game, there exists another PBE that implements the same outcome and in which the 
principal offers a RNP contract. 

Proof. If the principal offer < p 1 ,p 2 (0),p 2 (l) > and in equilibrium renegotiates to 
p'2 < P2(xi) after the agent' s choice of x\ G {0, 1}, then the contract < Pi,P2(0),p' 2 (l) > 
is RNP. 

This result is quite general. The general idea is that since the process of renegotiation is 
a game (mechanism), we can think the "grand mechanism" consisting of the contract and 
the subsequent renegotiation game. The "grand mechanism" itself is Renegotiation-Proof 
and implements the same outcome as the original contract followed by renegotiation. 

Once we restrict attention to RNP mechanisms, it is clear that the principal' s inability 
to commit not to renegotiate imposes an additional Renegotiation-Proofness constraint 
on the set of implementable outcomes (in addition to IC and IR constraints). That is, 
the principal cannot do better, and often does worse, than with full commitment. 

Example 16.2.1 When fj, > L/H , with full commitment the principal offers p\ = p 2 (0) = 
^2(1) = H, and the agent chooses X\{H) = X2(H) = 1, X\{L) = X2(L) = 0. However, 
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this outcome is not RNP. Indeed, since 112(0) = ; following x\ = the principal will 
offer p' 2 = L, and the agent will accept. Thus, the RNP constraint reduces the principal's 

payoff- 

The next observation is that the principal is better off with L renegotiable contracts 
than with ST contracts. Indeed, in the former case one possibility for the principal is 
to offer an ex ante contract specifying the default outcome of the second period. In our 
example, this corresponds to ^2(0) = ^2(1) = 00. This will implement the ST contracting 
outcome. Thus we have 

Proposition 16.2.2 // an outcome is implementable with ST contracting then it is also 
implementable with a LT Renegotiable contract (and therefore, by the RNP Principle, with 
a LT RNP contract). 

In our example, the intuition is that with LT renegotiable contracts the principal 
cannot commit against reducing the 2nd-period price, while with ST contracting she 
cannot commit against any modification of the 2nd-period price. 

When do long-term renegotiable contracts offer an advantage relative to short-term 
contracts? When the ability to commit to a low price is useful. For example, consider the 
possibility of full revelation in the first period when S > 1. With short-term contracts, 
full revelation was impossible: because of the ratchet effect, the high type would only buy 
in the first period at a price p± < L, but at this price the low type would also buy - "take 
the money and run" . 

It turns out, however, that the principal can achieve full revelation with a LT RNP 
contract < Pi,J>2(0), ^2(1) >• In particular, consider the following contract: 

p 1 = H + 5L, 
p 2 (l) = 0, 
p 2 (0) = L. 

This contract will have a full-revelation equilibrium. Indeed, (IChl) is satisfied: if' 
s surplus is 5(H — L) regardless of whether he buys or not. (IClh) is also satisfied: L's 
surplus would be negative if he bought. Intuitively, unlike in the ST contracting case, 
both incentive constraints can now be satisfied by committing against ratcheting - setting 
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p 2 (l) = 0. This allows to make the "take the money and run strategy" costly for the low 
type - he now has to pay for the 2nd-period consumption as well. He will not be willing 
to pay price H + 5L for both periods' consumptions, while the high type will pay it. The 
contract is also RNP, since ^2(^1) < L for all x\. 

Hart and Tirole (1988) reinterpret the model as describing contracting about the use 
of durable good. A short-term contract in this setting is a rental contract. The long- 
term contract with ^2(1) = can be interpreted as a sales contract. Thus, by using 
sales contracts in- stead of rental contracts, the seller eliminates the ratchet effect, make 
full revelation of information possible (but not always optimal). The renegotiation effect 
remains. 

More generally, Hart and Tirole consider a multi-period model. They show that long- 
term renegotiable rental contracts give rise to the same outcome as sales contracts. The 
resulting price follows a Coasian Dynamics: it decreases over time and eventually drops to 
L. When the number of periods is large and 5 is close to 1, the drop is almost immediate 
in real-time (cf. Gul-Sonneshein- Wilson 1988, Coase 1972). 

Reference 

Bolton, P. and M. Dewatripont (2005), Contract Theory, MIT Press. 

Mas-Colell, A., M. D. Whinston, and J. Green, Microeconomic, Oxford University Press, 
1995, Chapter 14. 

Segal, I. (2007), Lecture Notes in Contract Theory, Stanford University. 

Wolfstetter, E., Topics in Microeconomics - Industrial Organization, Auctions, and In- 
centives, Cambridge Press, 1999, Chapters 8-10. 



540 



